follow the greedy strategy (the one which is projected to be the best) with probability and explore with probablity .
1 min read
follow the greedy strategy (the one which is projected to be the best) with probability 1−ϵ and explore with probablity ϵ.