Offline learning makes a distinction between training and deployment phase. The agent is trained with mini-batch gradient descent on a static, pre-collected dataset.
Offline learning makes a distinction between training and deployment phase. The agent is trained with mini-batch gradient descent on a static, pre-collected dataset.