score function

The term “score function” is a leftover from the context in which it was first used in (genetic analysis).

Score function

The score function (or score) is the gradient of the log-likelihood function with respect to the parameter vector.
$s (θ; x) : = \frac{\partial lo g L ( θ ; x )}{\partial θ}$
For a single observation, it measures how sensitive the log-likelihood is to small changes in the parameter $θ$ at that point.

In general, the score function tells us how we should adjust our parameters to increase the likelihood of observing our data.
A positive score suggests increasing $θ$ would improve the fit, while a negative score suggests decreasing it.

Properties of the score function.

The score function has mean zero under the true parameter value $θ_{true}$ (the parameter that generated our observed data; not an estimate):
$E_{θ_{true}} [s (θ_{true}; X)] = 0$
This means when evaluated at the true parameter, there is no systematic “push” in any direction - fluctuations cancel out – we’re at the optimum. The variance of this score function is called the fisher information, measuring how much information a random variable $X$ carries about the parameter $θ$ .

Note: A zero gradient is not sufficient for global optimality, but could also be satisfied by local maxima, local minima, saddle points, … unless the parameter space is convex and the likelihood function is concave.

Max Wolf's Second Brain

Explorer

score function

Graph View

Backlinks