Roast topics
Find topics
Roast it!
Roast topics
Find topics
Find it!
Login
From:
Franz Louis Cesista
(Uncensored)
subscribe
Flash Attention Minimal
https://leloykun.github.io/personal-projects/flash-attention-minimal/
links
backlinks
A minimal implementation of Flash Attention 1 & 2 in just ~350 lines of CUDA code.