Login
From:
Stack Overflow
(Uncensored)
subscribe
cuda - Does PTX (8.4) not cover smaller-shape WMMA instructions? - Stack Overflow
https://stackoverflow.com/questions/78146946/does-ptx-8-4-not-cover-smaller-shape-wmma-instructions
links
backlinks
Roast topics
Find topics
Find it!
I want to use a SASS instruction which (AFAICT) is not available via a PTX instruction as of CUDA 12.4. Namely, suppose it is: HMMA.16816.F16 - a warp-wide matrix-multiply-and-add, of half-precisio...