-
Notifications
You must be signed in to change notification settings - Fork 26.3k
Closed
Labels
module: buildBuild system issuesBuild system issuestriagedThis issue has been looked at a team member, and triaged and prioritized into an appropriate moduleThis issue has been looked at a team member, and triaged and prioritized into an appropriate module
Description
🐛 Describe the bug
As far as I can tell, we never use the CMakeLists.txt in the third_party/cutlass which means we never enable the special MMA shape kernels in CUTLASS which increase performance on H100.
We should make sure to compile all these shape kernels as it's a onetime cost and can help any CUTLASS matmuls used by other ops like SPDA and Flash Attention
Versions
master as of 8/16/2024
Metadata
Metadata
Assignees
Labels
module: buildBuild system issuesBuild system issuestriagedThis issue has been looked at a team member, and triaged and prioritized into an appropriate moduleThis issue has been looked at a team member, and triaged and prioritized into an appropriate module