PR #22512: [XLA:GPU] Enable cuDNN kernel for NVFP4 block scaled dot #87121

copybara-service · 2025-02-12T09:15:15Z

PR #22512: [XLA:GPU] Enable cuDNN kernel for NVFP4 block scaled dot

Imported from GitHub PR openxla/xla#22512

Support NVFP4 in addition to MXFP8 hardware acceleration for the "__op$block_scaled_dot" custom call.

This PR also addresses some nits from the internal review (like renaming a generic CompositeType to a more specific CudnnMxType).
Copybara import of the project:

--
32e76a88b2107c079e26826417d22664cbf809a3 by Sergey Kozub [email protected]:

[XLA:GPU] Enable cuDNN kernel for NVFP4 block scaled dot

Merging this change closes #22512

FUTURE_COPYBARA_INTEGRATE_REVIEW=openxla/xla#22512 from openxla:skozub/block_scaling_nvfp4 32e76a88b2107c079e26826417d22664cbf809a3

Imported from GitHub PR openxla/xla#22512 Support NVFP4 in addition to MXFP8 hardware acceleration for the "__op$block_scaled_dot" custom call. This PR also addresses some nits from the internal review (like renaming a generic `CompositeType` to a more specific `CudnnMxType`). Copybara import of the project: -- 32e76a88b2107c079e26826417d22664cbf809a3 by Sergey Kozub <[email protected]>: [XLA:GPU] Enable cuDNN kernel for NVFP4 block scaled dot Merging this change closes #22512 PiperOrigin-RevId: 725985050

copybara-service bot force-pushed the exported_pr_725943746 branch from 7fb33fe to 491cbfb Compare February 12, 2025 11:42

copybara-service bot closed this Feb 12, 2025

copybara-service bot merged commit 491cbfb into master Feb 12, 2025
2 checks passed

copybara-service bot deleted the exported_pr_725943746 branch February 12, 2025 11:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

PR #22512: [XLA:GPU] Enable cuDNN kernel for NVFP4 block scaled dot #87121

PR #22512: [XLA:GPU] Enable cuDNN kernel for NVFP4 block scaled dot #87121

Uh oh!

copybara-service bot commented Feb 12, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

PR #22512: [XLA:GPU] Enable cuDNN kernel for NVFP4 block scaled dot #87121

PR #22512: [XLA:GPU] Enable cuDNN kernel for NVFP4 block scaled dot #87121

Uh oh!

Conversation

copybara-service bot commented Feb 12, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant