[QNN EP] Enablement of 64bit Udma mode by qti-monumeen · Pull Request #26677 · microsoft/onnxruntime

qti-monumeen · 2025-11-28T09:59:23Z

Description

Enabling 64bit udma mode for device architecture v81 or more

Motivation and Context

Support 64bit udma mode to run model efficiently on htp target v81 or above

quic-tirupath · 2025-12-11T23:00:11Z

@edgchen1
Could you please review and trigger CI on this PR.

edgchen1 · 2025-12-12T00:18:39Z

onnxruntime/test/providers/qnn/qnn_basic_test.cc

  std::filesystem::remove_all(dump_dir);
 }

+// Test exended UDMA mode on supported hardware (should run successfully)


Suggested change

// Test exended UDMA mode on supported hardware (should run successfully)

// Test extended UDMA mode on supported hardware (should run successfully)

how do we know if we are on supported hardware?

onnxruntime/test/providers/qnn/qnn_basic_test.cc

edgchen1 · 2025-12-12T00:39:16Z

/azp run Linux QNN CI Pipeline,Windows ARM64 QNN CI Pipeline

azure-pipelines · 2025-12-12T00:39:27Z

Azure Pipelines successfully started running 2 pipeline(s).

yuslepukhin · 2025-12-13T01:47:28Z

/azp run Linux QNN CI Pipeline, Win_TRT_Minimal_CUDA_Test_CI, Windows ARM64 QNN CI Pipeline, Windows GPU CUDA CI Pipeline, Windows GPU DML CI Pipeline, Windows GPU Doc Gen CI Pipeline, Windows GPU TensorRT CI Pipeline, Windows OpenVINO CI Pipeline, Windows x64 QNN CI Pipeline

azure-pipelines · 2025-12-13T01:47:48Z

Azure Pipelines successfully started running 4 pipeline(s).

Copilot

Pull request overview

This PR enables 64-bit UDMA (User-space Direct Memory Access) mode for QNN (Qualcomm Neural Network) execution provider on device architecture v81 and above. The feature is designed to improve performance on supported HTP (Hexagon Tensor Processor) hardware by enabling extended UDMA capabilities.

Key changes include:

Addition of a new extended_udma provider option that accepts values "0" (disabled) or "1" (enabled), defaulting to disabled
Integration of the extended UDMA configuration into the QNN context creation flow
Updates to test infrastructure files to support the new option in command-line argument parsing

Reviewed changes

Copilot reviewed 9 out of 9 changed files in this pull request and generated 4 comments.

Show a summary per file

File	Description
onnxruntime/test/providers/qnn/qnn_basic_test.cc	Adds test case for extended UDMA mode functionality with htp_arch v81
onnxruntime/test/perftest/ort_test_session.cc	Adds "extended_udma" to QNN provider options list and validation
onnxruntime/test/perftest/command_args_parser.cc	Adds documentation for the extended_udma option in help text
onnxruntime/test/onnx/main.cc	Adds "extended_udma" option validation and documentation
onnxruntime/test/ep_weight_sharing_ctx_gen/command_args_parser.cc	Adds "extended_udma" to context generation tool options
onnxruntime/core/providers/qnn/qnn_execution_provider.h	Adds member variable to store extended UDMA mode flag
onnxruntime/core/providers/qnn/qnn_execution_provider.cc	Parses extended_udma option and passes it to backend manager
onnxruntime/core/providers/qnn/builder/qnn_backend_manager.h	Updates method signatures to accept extended UDMA parameter
onnxruntime/core/providers/qnn/builder/qnn_backend_manager.cc	Implements extended UDMA configuration in QNN context creation

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

onnxruntime/test/providers/qnn/qnn_basic_test.cc

onnxruntime/core/providers/qnn/qnn_execution_provider.cc

onnxruntime/test/providers/qnn/qnn_basic_test.cc

onnxruntime/core/providers/qnn/builder/qnn_backend_manager.cc

yuslepukhin · 2025-12-16T18:40:47Z

Please, respond to the comments so the answers are documented.

yuslepukhin · 2026-01-05T22:53:12Z

/azp run Linux QNN CI Pipeline, Win_TRT_Minimal_CUDA_Test_CI,Windows ARM64 QNN CI Pipeline,Windows GPU Doc Gen CI Pipeline

azure-pipelines · 2026-01-05T22:53:30Z

Azure Pipelines successfully started running 4 pipeline(s).

yuslepukhin · 2026-01-15T22:13:51Z

This branch needs to be rebased (merged) from main.

Support 64bit udma model to run model efficiently on HTP v81 or above. Implement a new QNN option "extended_udma` and propagate it through context config to HTP.

tirupath-qti · 2026-01-28T01:03:00Z

@edgchen1 and @yuslepukhin
This PR is missed from 1.24 but we still want to merge in mainline as this is required for enabling GPT-OSS model on QC platforms. This can be picked into any future point release.

Could you please review and trigger CI.

edgchen1 · 2026-01-28T01:46:18Z

/azp run Linux QNN CI Pipeline, Win_TRT_Minimal_CUDA_Test_CI,Windows ARM64 QNN CI Pipeline,Windows GPU Doc Gen CI Pipeline

azure-pipelines · 2026-01-28T01:46:37Z

Azure Pipelines successfully started running 4 pipeline(s).

tirupath-qti · 2026-01-29T19:14:03Z

@edgchen1
seems there is one CI job unrelated to QNN EP failed. Could you please unblock this PR and approve it.
Note: this is needed to enable GPT-OSS QDQ model on QC hardware.

### Description Enabling 64bit udma mode for device architecture v81 or more ### Motivation and Context Support 64bit udma mode to run model efficiently on htp target v81 or above

This cherry-picks the following commits for the 1.24.2 release: - #27096 - #27077 - #26677 - #27238 - #27213 - #27256 - #27278 - #27275 - #27276 - #27216 - #27271 - #27299 - #27294 - #27266 - #27176 - #27126 - #27252 --------- Co-authored-by: Xiaofei Han <[email protected]> Co-authored-by: Jiajia Qin <[email protected]> Co-authored-by: Yulong Wang <[email protected]> Co-authored-by: qti-monumeen <[email protected]> Co-authored-by: Ankit Maheshkar <[email protected]> Co-authored-by: Eric Crawford <[email protected]> Co-authored-by: Copilot <[email protected]> Co-authored-by: guschmue <[email protected]> Co-authored-by: Guenther Schmuelling <[email protected]> Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: angelser <[email protected]> Co-authored-by: Angela Serrano Brummett <[email protected]> Co-authored-by: Misha Chornyi <[email protected]> Co-authored-by: hariharans29 <[email protected]> Co-authored-by: eserscor <[email protected]> Co-authored-by: Copilot <[email protected]> Co-authored-by: Baiju Meswani <[email protected]> Co-authored-by: Adrian Lizarraga <[email protected]> Co-authored-by: Ti-Tai Wang <[email protected]> Co-authored-by: bmehta001 <[email protected]>

qti-monumeen marked this pull request as ready for review November 28, 2025 10:18

edgchen1 added the ep:QNN issues related to QNN exeution provider label Dec 1, 2025

quic-tirupath approved these changes Dec 11, 2025

View reviewed changes

edgchen1 reviewed Dec 12, 2025

View reviewed changes

onnxruntime/test/providers/qnn/qnn_basic_test.cc Outdated Show resolved Hide resolved

onnxruntime/test/providers/qnn/qnn_basic_test.cc Outdated Show resolved Hide resolved

yuslepukhin requested a review from Copilot December 13, 2025 01:47

Copilot started reviewing on behalf of yuslepukhin December 13, 2025 01:48 View session

Copilot AI reviewed Dec 13, 2025

View reviewed changes

edgchen1 previously approved these changes Jan 15, 2026

View reviewed changes

qti-monumeen added 3 commits January 19, 2026 11:00

[QNN EP] Enablement of 64bit Udma mode

5bcd96e

Support 64bit udma model to run model efficiently on HTP v81 or above. Implement a new QNN option "extended_udma` and propagate it through context config to HTP.

Adressing comments

ce083ea

Addressing new comments

8550225

minfhong-qti dismissed edgchen1’s stale review via 8550225 January 19, 2026 03:31

minfhong-qti force-pushed the dev/qti-monumeen/64bit-udma-mode-enablement branch from 54d64dc to 8550225 Compare January 19, 2026 03:31

Merge branch 'main' into dev/qti-monumeen/64bit-udma-mode-enablement

ebe496d

edgchen1 approved these changes Jan 30, 2026

View reviewed changes

edgchen1 enabled auto-merge (squash) January 30, 2026 01:08

edgchen1 merged commit 711d155 into microsoft:main Feb 3, 2026
119 of 126 checks passed

edgchen1 added the release:1.24.2 label Feb 5, 2026

tianleiwu mentioned this pull request Feb 12, 2026

ORT 1.24.2 release cherry pick round 1 #27330

Merged

tianleiwu removed the release:1.24.2 label Feb 12, 2026

	// Test exended UDMA mode on supported hardware (should run successfully)
	// Test extended UDMA mode on supported hardware (should run successfully)

Conversation

qti-monumeen commented Nov 28, 2025

Description

Motivation and Context

Uh oh!

quic-tirupath commented Dec 11, 2025

Uh oh!

edgchen1 Dec 12, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

edgchen1 commented Dec 12, 2025

Uh oh!

azure-pipelines bot commented Dec 12, 2025

Uh oh!

yuslepukhin commented Dec 13, 2025

Uh oh!

azure-pipelines bot commented Dec 13, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

yuslepukhin commented Dec 16, 2025

Uh oh!

yuslepukhin commented Jan 5, 2026

Uh oh!

azure-pipelines bot commented Jan 5, 2026

Uh oh!

yuslepukhin commented Jan 15, 2026

Uh oh!

tirupath-qti commented Jan 28, 2026

Uh oh!

edgchen1 commented Jan 28, 2026

Uh oh!

azure-pipelines bot commented Jan 28, 2026

Uh oh!

tirupath-qti commented Jan 29, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants