Login
From:
shreyansh26.github.io
(Uncensored)
subscribe
Paper Summary #8 - FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness | Shreyansh Singh
https://shreyansh26.github.io/post/2023-03-26_flash-attention/
links
backlinks
Roast topics
Find topics
Find it!
Understanding FlashAttention which is the most efficient exact attention implementation out there, which optimizes for both memory requirements and wall-clock time.