Login
From:
Giles' Blog
(Uncensored)
subscribe
Writing an LLM from scratch, part 12 -- multi-head attention :: Giles' blog
https://www.gilesthomas.com/2025/04/llm-from-scratch-12-multi-head-attention
links
backlinks
Tagged with:
python
ai
llm from scratch
til deep dives
Finally getting to the end of chapter 3 of Raschka’s LLM book! This time it’s multi-head attention: what it is, how it works, and why the code does what it does.
Roast topics
Find topics
Find it!