Max Wolf's Second Brain

Home

❯

general

❯

decision transformer

decision transformer

Apr 24, 20261 min read

https://lilianweng.github.io/posts/2023-01-27-the-transformer-family-v2/#transformers-for-reinforcement-learning

The experiments compared DT with several model-free RL algorithm baselines and showed that:

  • DT is more efficient than behavior cloning in low data regime;
  • DT can model the distribution of returns very well;
  • Having a long context is crucial for obtaining good results;
  • DT can work with sparse rewards.

off-policy
transformers in RL


Graph View

Created with Quartz v4.5.2 © 2026

  • GitHub
  • Discord Community