covariance

Covariance is just like correlation - it tells you how two random variables change together - with the difference that covariance is not necessarily bounded / normalized between $[- 1; 1]$ .

In fact, the covariance matrix is the unnormalized correlation matrix.

The covariance between two variables $X$ and $Y$ is calculated as the average of the product of their deviations from their respective means $\overset{ˉ}{X}$ and $\overset{ˉ}{Y}$ .

$Cov (X, Y) = \frac{1}{n} i = 1 \sum n (X_{i} - \overset{ˉ}{X}) (Y_{i} - \overset{ˉ}{Y}) = E [(X - [E [X]]) (Y - E [Y])] = E [X Y] - E [X] E [Y]$

Derivation Properties:

Rearranging, using

$= E [(X - [E [X]]) (Y - E [Y])] = E [X Y - E [X] Y - E [Y] X + E [X] E [Y]] = E [X Y] - E [E [X]] E [Y] - E [E [Y]] E [X] + E [E [X]] E [E [Y]] = E [X Y] - E [X] E [Y] - E [X] E [Y] + E [X] E [Y] = E [X Y] - E [X] E [Y]$

Covariance can be interpreted as:

$Cov (X, Y) > 0$ →when $X$ increases, $Y$ tends to increase as well, and vice versa.
$Cov (X, Y) < 0$ → when $X$ increases, $Y$ tends to decrease, and vice versa.
$Cov (X, Y) = 0$ → no linear relationship between $X$ and $Y$ .

The covariance of a random variable and itself is its variance

$Cov (X, X) = \frac{1}{n} i = 1 \sum n (X_{i} - \overset{ˉ}{X})^{2} = Var (X)$

The order of the variables does not matter (commutative property)

$Cov (A, B) Σ_{A B} = Cov (B, A) = Σ_{A B}^{T}$

Limitations

Covariance does not indicate the strength of the relationship, nor its causality. So if two distributions are independent, their covariance will be zero but the reverse it not true because it does not take non-linear relationsips into account. However, if distributions are multivarate normal, a zero covariance implies independence.
It is also sensitive to the scale of the variables, making it difficult to compare covariances across different datasets.

center

Covariance Matrix

Covariance matrix

The second moment of a random variable $X$ is $E [X^{2}]$ → The second moment matrix contains all pairwise $E [X_{i} \cdot X_{j}]$ – dot products ( $E [X X^{T}]$ ).
When centered (mean subtracted), this becomes the covariance matrix (or “central second moment”):
$Covariance Matrix = E [(X - μ) (X - μ)^{T}] = E [X X^{T}] - μ μ^{T}$

See also multivariate gaussian distribution for a more detailed and visual overview.

Illustrated with a simple example

How much somone likes some fruite example table (higher score means they like the fruit more):

Apple	Banana
1	1
3	0
-1	-1

The means of both random variables:

μ_{A} μ_{B} = (1 + 3 - 1) /3 = 1 = (1 + 0 - 1) /3 = 0

The covariance matrix of these two random variables has the following form:

K_{A B} = [Cov (A, A) Cov (B, A) Cov (A, B) Cov (B, B)] = [Var (A) Cov (B, A) Cov (A, B) Var (B)]

Since, $Cov (A, B) = Cov (B, A)$ → $K_{A B} = K_{A B}^{T}$ (we know the elements of one triangle, we can just mirror it).

Cov (A, B) C o v (A, A) C o v (B, B) K_{A B} = E [A B] - E [A] E [B] = \frac{( 1 \cdot 1 + 3 \cdot 0 + - 1 \cdot - 1 )}{3} - 1 \cdot 0 = \frac{2}{3} = E [A^{2}] - E [A]^{2} = \frac{11}{3} - 1 = \frac{8}{3} = E [B^{2}] - 0 = \frac{2}{3} = [\frac{8}{3} \frac{2}{3} \frac{2}{3} \frac{2}{3}]

References

gpt
cov matrix yt

Max Wolf's Second Brain

Explorer

covariance

Covariance Matrix

References

Graph View

Table of Contents

Backlinks