fisher information

Fisher Information

The Fisher information $I (θ)$ quantifies how much information a random variable $X$ carries about an unknown parameter $θ$ that describes its probability distribution. For a PDF $f (x; θ)$ , it is defined as:
$I (θ) = E [(\frac{\partial}{\partial θ} lo g f (X; θ))^{2}] = - E [\frac{\partial ^{2}}{\partial θ ^{2}} lo g f (X; θ)]$
where the expectation is taken with respect to $f (x; θ)$ .

Fisher information measures the curvature of the log-likelihood function around the true parameter value.
A higher Fisher information indicates:

The distribution is more sensitive to changes in the parameter $θ$

We can estimate $θ$ more precisely from observations

The likelihood function has a sharper peak around the true value

The first form of the definition shows that fisher information is the variance of the score function.

The second form of the definition (using second derivative) directly shows this connection to curvature, since the second derivative measures how quickly the slope changes – a more negative second derivative indicates a sharper peak in the log-likelihood.

Cramér-Rao bound: The variance of any unbiased estimator $\hat{θ}$ is bounded below by the inverse of the Fisher information:

$Var (\hat{θ}) \geq \frac{1}{I ( θ )}$
→ Fisher information directly determines the best possible precision we can achieve when estimating parameters from data.

For a normal distribution $N (μ, σ^{2})$ , the Fisher information with respect to the mean $μ$ is:

$I (μ) = \frac{1}{σ ^{2}}$
→ We can estimate the mean more precisely (higher Fisher information) when the variance is smaller.

Max Wolf's Second Brain

Explorer

fisher information

Graph View

Backlinks