Login
From:
Colfax Research
(Uncensored)
subscribe
CUTLASS Tutorial: Efficient GEMM kernel designs with Pipelining – Colfax Research
https://research.colfax-intl.com/cutlass-tutorial-design-of-a-gemm-kernel/
links
backlinks
Roast topics
Find topics
Find it!
Welcome to Part 2 of our tutorial series on GEMM (GEneral Matrix Multiplication). In Part 1, we discussed the computational side of GEMM by going over WGMMA, which is the primitive instruction to m…