Login
From:
Franz Louis Cesista
(Uncensored)
subscribe
Flash Attention Minimal
https://leloykun.github.io/personal-projects/flash-attention-minimal/
links
backlinks
Roast topics
Find topics
Find it!
A minimal implementation of Flash Attention 1 & 2 in just ~350 lines of CUDA code.