cma-es

The CMA Evolution Strategy: A Tutorial

go through this in ~december

Motivation

CMA-ES patches the shortcomings of simple versions of ES and GA having fixed standard deviation for the noise (exploration).
But we want the search process to explore more broadly, until we are confident that a good optima is found:

CMA-ES

CMA-ES biases the search/distribution towards the direction of the elite, and adapts it depending on whether the best solutions are far away or close by, in addition to updating the mean of the distribution towards the elite.

Naive covariance matrix adaptation

We could try to naively adapt the covariance matrix of the current population, but instead of calculating w.r.t.t. mean of the entire population, we calculate w.r.t.t. mean of the elites.
But that alone is not enough. This way alone the distribution gets stretched very thin in the direction of the elites, and the size of the distribution shrinks quickly, because if the distribution is a long ellipse, the new population will be spread out along that axis too, and if it shrinks, the elite will be closer to the mean, the distribution shrinks even more, …

$C = B D^{2} B^{T}$

N (m, C) \sim m + N (0, C) \sim m + C^{1/2} N (0, I) \sim m + B D B^{T} N (0, I) \sim m + B D N (0, I)

Line 1: Affine property of gaussian
Line 2: $C^{1/2}$ is the matrix square root of $C$
Line 3: eigendecomposition of $C$
Line 4: $B^{T} N (0, I) \sim N (0, I)$ since $B$ is a orthogonal matrix and rotations of a standard normal distribution are still standard normal (rotating a sphere).

Since the covariance calculation scales with $O (N^{2})$ , cma-es starts becoming unpractical for > ~10k parameters, but low-rank approximations, for example: LM-MA-ES or sep-CMA-ES

covariance matrix evolution strategies

CMA-ES YouTube series + blog:
https://szhaovas.github.io/2023-02-06-cmaesall/
https://szhaovas.github.io/2022-09-06-cmaes/
https://szhaovas.github.io/2022-09-07-cmaes2/
https://szhaovas.github.io/2022-09-09-cmaes3/

https://inria.hal.science/hal-00808450v1/document

Max Wolf's Second Brain

Explorer

cma-es

Graph View

Backlinks