follow the greedy strategy (the one which is projected to be the best) with probability and explore with probablity .