Max Wolf's Second Brain

Home

❯

general

❯

mechanistic interpretability

mechanistic interpretability

Jan 11, 20261 min read

interpretability

A Comprehensive Mechanistic Interpretability Explainer & Glossary


Graph View

Backlinks

  • Anthropic Interpretability - Understanding how AI models think
  • Toy Models of Superposition
  • mixed selectivity

Created with Quartz v4.5.1 © 2026

  • GitHub
  • Discord Community