Topic: Writing an LLM from scratch, part 7 -- wrapping up non-trainable self-attention