The negative log likelihood loss (aka surprise, aka categorical cross-entropy loss):
where is the predicted probability of the true class label , for the example.
This is a simpler version of cross-entropy loss, also called categorical cross-entropy.