[NV TRT RTX EP] Cumulative TRT RTX EP merge #25656

gedoensmax · 2025-08-05T12:40:40Z

This currently holds 2 major improvements:

dynamic shape models should have much lower memory usage and in addition to that the management is move towards ORT allocators
the overhead for shape binding and address updates is reduce per inference

TRT RTX workspace allocation using ORT Arena allocator See merge request winai/onnxruntime!4

Reduce CPU overhead of TRT-RTX EP's compute function See merge request winai/onnxruntime!20

chilo-ms · 2025-08-05T17:14:31Z

/azp run Linux QNN CI Pipeline,Win_TRT_Minimal_CUDA_Test_CI,Windows ARM64 QNN CI Pipeline,Windows GPU Doc Gen CI Pipeline,Windows x64 QNN CI Pipeline

azure-pipelines · 2025-08-05T17:14:52Z

Azure Pipelines successfully started running 5 pipeline(s).

skottmckay

onnxruntime/core/providers/nv_tensorrt_rtx/nv_execution_provider.h

onnxruntime/core/providers/nv_tensorrt_rtx/nv_execution_provider.cc

This currently holds 2 major improvements: - dynamic shape models should have much lower memory usage and in addition to that the management is move towards ORT allocators - the overhead for shape binding and address updates is reduce per inference --------- Co-authored-by: Gaurav Garg <[email protected]>

…5, 25652 (#25701) ### Description Cherry-pick the following PRs into the `rel-1.23.0` branch: - #25391 - #25611 - #25656 - #25346 - #25374 - #25664 - #25675 - #25652 ### Motivation and Context  --------- Co-authored-by: Yulong Wang <[email protected]> Co-authored-by: Ishwar Raut <[email protected]> Co-authored-by: Maximilian Müller <[email protected]> Co-authored-by: Gaurav Garg <[email protected]> Co-authored-by: Scott McKay <[email protected]> Co-authored-by: Chi Lo <[email protected]> Co-authored-by: Abhishek Jindal <[email protected]> Co-authored-by: Dmitri Smirnov <[email protected]>

This currently holds 2 major improvements: - dynamic shape models should have much lower memory usage and in addition to that the management is move towards ORT allocators - the overhead for shape binding and address updates is reduce per inference --------- Co-authored-by: Gaurav Garg <[email protected]>

gedoensmax and others added 5 commits August 1, 2025 13:42

fix CI due to unused function and misconfigured unit test

a676fbb

TRT RTX workspace allocation using ORT Arena allocator

a917b30

Merge branch 'nv_device_alloc_lite' into 'ga_staging'

b0a9a45

TRT RTX workspace allocation using ORT Arena allocator See merge request winai/onnxruntime!4

Reduce CPU overhead of TRT-RTX EP's compute function

9c918b6

Merge branch 'reduce_cpu_ovehead' into 'ga_staging'

a27907a

Reduce CPU overhead of TRT-RTX EP's compute function See merge request winai/onnxruntime!20

gedoensmax mentioned this pull request Aug 5, 2025

[NV TRT RTX EP] Leverage ORT allocator for workspace allocations #25564

Closed

chilo-ms added ep:NvRTX NV RTX execution provider release:1.23.0 labels Aug 5, 2025

skottmckay approved these changes Aug 5, 2025

View reviewed changes

gedoensmax commented Aug 5, 2025

View reviewed changes

onnxruntime/core/providers/nv_tensorrt_rtx/nv_execution_provider.h Show resolved Hide resolved

onnxruntime/core/providers/nv_tensorrt_rtx/nv_execution_provider.cc Show resolved Hide resolved

onnxruntime/core/providers/nv_tensorrt_rtx/nv_execution_provider.cc Show resolved Hide resolved

chilo-ms approved these changes Aug 6, 2025

View reviewed changes

chilo-ms merged commit d912167 into microsoft:main Aug 6, 2025
91 of 92 checks passed

adrianlizarraga mentioned this pull request Aug 8, 2025

1.23.0 cherry-pick prs 25391, 25611, 25656, 25346, 25374, 25664, 25675, 25652 #25701

Merged

jywu-msft removed the release:1.23.0 label Aug 28, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[NV TRT RTX EP] Cumulative TRT RTX EP merge #25656

[NV TRT RTX EP] Cumulative TRT RTX EP merge #25656

Uh oh!

gedoensmax commented Aug 5, 2025 •

edited

Loading

Uh oh!

chilo-ms commented Aug 5, 2025

Uh oh!

azure-pipelines bot commented Aug 5, 2025

Uh oh!

skottmckay left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

[NV TRT RTX EP] Cumulative TRT RTX EP merge #25656

[NV TRT RTX EP] Cumulative TRT RTX EP merge #25656

Uh oh!

Conversation

gedoensmax commented Aug 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

chilo-ms commented Aug 5, 2025

Uh oh!

azure-pipelines bot commented Aug 5, 2025

Uh oh!

skottmckay left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

gedoensmax commented Aug 5, 2025 •

edited

Loading