Max Wolf's Second Brain

Home

❯

general

❯

Energy Based Transformers are Scalable Learners and Thinkers

Energy-Based Transformers are Scalable Learners and Thinkers

Feb 10, 20261 min read

year: 2025
paper: https://arxiv.org/pdf/2507.02092
website: https://alexiglad.github.io/blog/2025/ebt/
code: https://github.com/alexiglad/ebt
connections: energy-based models, transformer, test-time compute, system-2 thinking, diffusion models, generation-verification


Read the paper in detail + take notes

Refocus on this once it gets some traction / validation on a larger scale… or if I just happen have time for it / the other building blocks of soup are better fleshed out.


Graph View

Backlinks

  • energy-based models

Created with Quartz v4.5.2 © 2026

  • GitHub
  • Discord Community