Computer Science

Transformers, intuitively

Attention, residual streams, and the architecture that took over AI — with the pictures that finally make it click.

7 lessons~110 min totalFeynman
What you'll learn
  • Explain a transformer block end-to-end without hand-waving
  • Read attention as soft lookup with real intuition
  • See why this shape generalized so much further than anyone expected
Progress0 / 7
Track complete ✓
Lessons
Related tracks