[CUDA] Fix build for sm<53 by tianleiwu · Pull Request #24582 · microsoft/onnxruntime

tianleiwu · 2025-04-28T20:29:50Z

Description

There is some build error for --cmake_extra_defines CMAKE_CUDA_ARCHITECTURES=52.

Some half2 function like __hfma2 used in MatMul 8 bits is not defined for sm < 53. Add an implementation that does not use half2 for those old GPUs.

Fix another build error using cuda 12.5 that is caused by extra const in MOE code for sm<53.

Motivation and Context

Fix nuget packaging pipeline, which uses CMAKE_CUDA_ARCHITECTURES=52-real;61-real;75-real;86-real;89-real;90-virtual.

### Description There is some build error for `--cmake_extra_defines CMAKE_CUDA_ARCHITECTURES=52`. Some half2 function like `__hfma2` used in MatMul 8 bits is not defined for sm < 53. Add an implementation that does not use half2 for those old GPUs. Fix another build error using cuda 12.5 that is caused by extra `const` in MOE code for sm<53. ### Motivation and Context Fix nuget packaging pipeline, which uses `CMAKE_CUDA_ARCHITECTURES=52-real;61-real;75-real;86-real;89-real;90-virtual`.

### Description Cherry pick the following into [rel-1.22.0](https://github.com/microsoft/onnxruntime/tree/rel-1.22.0) - (#24491) - (#24509) - (#24564) - (#24574) - (#24582) - (#24584) - (#24568) - (#24587) - (#24563) - (#24592) - (#24526) - (#24552) - (#24588) - (#24605) - (#24606) --------- Co-authored-by: Jing Fang <[email protected]> Co-authored-by: Tianlei Wu <[email protected]> Co-authored-by: Baiju Meswani <[email protected]> Co-authored-by: Scott McKay <[email protected]> Co-authored-by: Mark Schofield <[email protected]> Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: Edward Chen <[email protected]> Co-authored-by: Ashwath Shankarnarayan <[email protected]> Co-authored-by: saurabh <[email protected]> Co-authored-by: Adrian Lizarraga <[email protected]> Co-authored-by: Hector Li <[email protected]>

### Description There is some build error for `--cmake_extra_defines CMAKE_CUDA_ARCHITECTURES=52`. Some half2 function like `__hfma2` used in MatMul 8 bits is not defined for sm < 53. Add an implementation that does not use half2 for those old GPUs. Fix another build error using cuda 12.5 that is caused by extra `const` in MOE code for sm<53. ### Motivation and Context Fix nuget packaging pipeline, which uses `CMAKE_CUDA_ARCHITECTURES=52-real;61-real;75-real;86-real;89-real;90-virtual`.

snnn · 2025-09-05T20:48:53Z

This PR has been included in the rel-1.22.0 branch. Removing the release:1.22.0 label.

fix build for sm=52

ae9c380

tianleiwu added the release:1.22.0 label Apr 28, 2025

tianleiwu requested review from baijumeswani, jiafatom and kunal-vaishnavi April 28, 2025 20:34

jiafatom approved these changes Apr 28, 2025

View reviewed changes

kunal-vaishnavi approved these changes Apr 28, 2025

View reviewed changes

snnn approved these changes Apr 28, 2025

View reviewed changes

baijumeswani approved these changes Apr 28, 2025

View reviewed changes

snnn merged commit 76cee36 into main Apr 29, 2025
80 of 88 checks passed

snnn deleted the tlwu/fix_matmul_8bits_old_gpu branch April 29, 2025 02:16

vraspar mentioned this pull request May 1, 2025

Cherry-picks into rel-1.22.0 #24611

Merged

snnn removed the release:1.22.0 label Sep 5, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CUDA] Fix build for sm<53#24582

[CUDA] Fix build for sm<53#24582
snnn merged 1 commit intomainfrom
tlwu/fix_matmul_8bits_old_gpu

tianleiwu commented Apr 28, 2025 •

edited

Loading

Uh oh!

Uh oh!

snnn commented Sep 5, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

tianleiwu commented Apr 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Motivation and Context

Uh oh!

Uh oh!

snnn commented Sep 5, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

tianleiwu commented Apr 28, 2025 •

edited

Loading