Login
From:
Giles' Blog
(Uncensored)
subscribe
Writing an LLM from scratch, part 12 -- multi-head attention :: Giles' blog
https://www.gilesthomas.com/2025/04/llm-from-scratch-12-multi-head-attention
links
backlinks
Tagged with:
python
ai
llm from scratch
til deep dives
Finally getting to the end of chapter 3 of Raschka’s LLM book! This time it’s multi-head attention: what it is, how it works, and why the code does what it does.
Roast topics
Find topics
Roast it!
Roast topics
Find topics
Find it!
Roast topics
Find topics
Find it!