fix: setting builder optimization level to TRT 8.6 default by gedoensmax · Pull Request #15897 · microsoft/onnxruntime

gedoensmax · 2023-05-10T19:23:32Z

The actual released default level is 3 and not the previously used 2.

Just a small sample of the effects:

chilo-ms · 2023-05-10T20:00:11Z

/azp run Linux CPU CI Pipeline, Linux CPU Minimal Build E2E CI Pipeline, Linux GPU CI Pipeline, Linux GPU TensorRT CI Pipeline, Linux Nuphar CI Pipeline, Linux OpenVINO CI Pipeline, MacOS CI Pipeline, ONNX Runtime Web CI Pipeline, Windows CPU CI Pipeline, Windows GPU CI Pipeline

chilo-ms · 2023-05-10T20:00:23Z

/azp run Windows GPU TensorRT CI Pipeline, onnxruntime-binary-size-checks-ci-pipeline, onnxruntime-python-checks-ci-pipeline, orttraining-linux-ci-pipeline, orttraining-linux-gpu-ci-pipeline, orttraining-ortmodule-distributed

azure-pipelines · 2023-05-10T20:02:01Z

Azure Pipelines successfully started running 5 pipeline(s).

azure-pipelines · 2023-05-10T20:02:09Z

Azure Pipelines successfully started running 9 pipeline(s).

jywu-msft · 2023-05-10T21:31:55Z

/azp run Linux QNN CI Pipeline, Windows ARM64 QNN CI Pipeline

azure-pipelines · 2023-05-10T21:32:06Z

Azure Pipelines successfully started running 2 pipeline(s).

chilo-ms · 2023-05-11T04:32:58Z

Could you help merge the main to pass "Linux GPU TensorRT CI Pipeline"?
There are some PRs related to ONNX 1.14 release were merged to main right after your PR.

jywu-msft · 2023-05-11T14:49:41Z

/azp run Linux CPU CI Pipeline, Linux CPU Minimal Build E2E CI Pipeline, Linux GPU CI Pipeline, Linux GPU TensorRT CI Pipeline, Linux Nuphar CI Pipeline, Linux OpenVINO CI Pipeline, MacOS CI Pipeline, ONNX Runtime Web CI Pipeline, Windows CPU CI Pipeline, Windows GPU CI Pipeline

jywu-msft · 2023-05-11T14:49:56Z

/azp run Windows GPU TensorRT CI Pipeline, onnxruntime-binary-size-checks-ci-pipeline, onnxruntime-python-checks-ci-pipeline, orttraining-linux-ci-pipeline, orttraining-linux-gpu-ci-pipeline, orttraining-ortmodule-distributed

jywu-msft · 2023-05-11T14:50:08Z

/azp run Linux QNN CI Pipeline, Windows ARM64 QNN CI Pipeline

azure-pipelines · 2023-05-11T14:50:18Z

Azure Pipelines successfully started running 9 pipeline(s).

azure-pipelines · 2023-05-11T14:50:18Z

Azure Pipelines successfully started running 5 pipeline(s).

azure-pipelines · 2023-05-11T14:50:19Z

Azure Pipelines successfully started running 2 pipeline(s).

chilo-ms · 2023-05-11T17:43:50Z

onnxruntime/core/providers/tensorrt/tensorrt_execution_provider.cc

line 2831 & 2892 need to be modified as well.

if (trt_state->builder_optimization_level != 2) {

we'll have to kick off the CI again after those lines are updated.

we really need to simplify option updates, too error prone currently due to the need to change too many places in code.

Done! Is this something your team is working on right now ?

it's on our radar and we would like to dedicate some time/resources on it.

minor change needed

jywu-msft · 2023-05-12T14:50:38Z

/azp run Linux CPU CI Pipeline, Linux CPU Minimal Build E2E CI Pipeline, Linux GPU CI Pipeline, Linux GPU TensorRT CI Pipeline, Linux Nuphar CI Pipeline, Linux OpenVINO CI Pipeline, MacOS CI Pipeline, ONNX Runtime Web CI Pipeline, Windows CPU CI Pipeline, Windows GPU CI Pipeline

jywu-msft · 2023-05-12T14:50:52Z

/azp run Windows GPU TensorRT CI Pipeline, onnxruntime-binary-size-checks-ci-pipeline, onnxruntime-python-checks-ci-pipeline, orttraining-linux-ci-pipeline, orttraining-linux-gpu-ci-pipeline, orttraining-ortmodule-distributed

jywu-msft · 2023-05-12T14:51:13Z

/azp run Linux QNN CI Pipeline, Windows ARM64 QNN CI Pipeline

azure-pipelines · 2023-05-12T14:51:14Z

Azure Pipelines successfully started running 5 pipeline(s).

azure-pipelines · 2023-05-12T14:51:19Z

Azure Pipelines successfully started running 9 pipeline(s).

azure-pipelines · 2023-05-12T14:51:24Z

Azure Pipelines successfully started running 2 pipeline(s).

The actual released default level is 3 and not the previously used 2. Just a small sample of the effects: ![Screenshot 2023-05-10 at 15 49 55](https://github.com/microsoft/onnxruntime/assets/44298237/5a694446-22c0-4943-9ddf-80670781878f)

### Description Cherry-picks 26 commits to the release branch. Most cherry-picks are clean merges. Except: 1. When I got conflicts in cgmanifest.json and download-deps.yml, I choose to ignore the conflicts and regenerate the two files 2. There were some conflicts in cmake/deps.txt, onnxruntime_c_api.cc PR list: [js/webgpu] fix Transpose with non-float tensor (#15819) [js/web] fix terser reserved symbols for worker (#15864) [JSEP] fix constructor for OrtDevice (#15805) Bump engine.io from 6.4.1 to 6.4.2 in /js/web (#15799) Bump engine.io from 6.4.0 to 6.4.2 in /onnxruntime/test/wasm (#15798) [wasm] revert emsdk to v3.1.19 (#15793) [wasm/JSEP] add threaded build to artifacts (#15777) [js/web] add target ort.webgpu.min.js (#15780) update ort extensions to 94142d8391c9791ec71c38336436319a2d4ac7a0 (#15688) fix: setting builder optimization level to TRT 8.6 default (#15897) Adust GetVersionString() GetBuildInfoString() signatures and move them to OrtApi (#15921) Fix segfault for multiple GPU run (regression) (#15823) android package fix (#15999) [CoreML EP] Minor changes to allow CoreML EP to handle more nodes and models. (#15993) Adding support for conv fp16 fusion on Resnet50v1 (#15474) update onnx release 1.14 for docker files (#15680) Avoid generating training documentation during packaging (#15795) Update Conv-Add-Relu Fusion Transformation (#15834) Fix symbolic shape infer empty value_info (#15842) NhwcFusedConv: Add before Activation (#15837) use __hmul2 instead of __hmul2_rn (#15852) change the EP device to default OrtDevice() for memoryType equals CPU Input (#15903) Fixing NhwcFusedConv fp16 (#15950) fix topo sort in quantization tool (#16003) [doc] add LeakyRelu to coreml supported ops (#15944) [DML EP] Add frequent upload heap flushing (#15960) Co-authored-by: Yulong Wang Co-authored-by: dependabot[bot] Co-authored-by: Guenther Schmuelling Co-authored-by: Shalva Mist Co-authored-by: Maximilian Müller Co-authored-by: Dmitri Smirnov Co-authored-by: pengwa Co-authored-by: Ashwini Khade Co-authored-by: Edward Chen Co-authored-by: Jian Chen Co-authored-by: liqun Fu Co-authored-by: Baiju Meswani Co-authored-by: Tianlei Wu Co-authored-by: Chen Fu Co-authored-by: Ye Wang Co-authored-by: cao lei Co-authored-by: Yufeng Li Co-authored-by: Rachel Guo Co-authored-by: Patrice Vignola

### Description Cherry-picks 26 commits to the release branch. Most cherry-picks are clean merges. Except: 1. When I got conflicts in cgmanifest.json and download-deps.yml, I choose to ignore the conflicts and regenerate the two files 2. There were some conflicts in cmake/deps.txt, onnxruntime_c_api.cc PR list: [js/webgpu] fix Transpose with non-float tensor (microsoft#15819) [js/web] fix terser reserved symbols for worker (microsoft#15864) [JSEP] fix constructor for OrtDevice (microsoft#15805) Bump engine.io from 6.4.1 to 6.4.2 in /js/web (microsoft#15799) Bump engine.io from 6.4.0 to 6.4.2 in /onnxruntime/test/wasm (microsoft#15798) [wasm] revert emsdk to v3.1.19 (microsoft#15793) [wasm/JSEP] add threaded build to artifacts (microsoft#15777) [js/web] add target ort.webgpu.min.js (microsoft#15780) update ort extensions to 94142d8391c9791ec71c38336436319a2d4ac7a0 (microsoft#15688) fix: setting builder optimization level to TRT 8.6 default (microsoft#15897) Adust GetVersionString() GetBuildInfoString() signatures and move them to OrtApi (microsoft#15921) Fix segfault for multiple GPU run (regression) (microsoft#15823) android package fix (microsoft#15999) [CoreML EP] Minor changes to allow CoreML EP to handle more nodes and models. (microsoft#15993) Adding support for conv fp16 fusion on Resnet50v1 (microsoft#15474) update onnx release 1.14 for docker files (microsoft#15680) Avoid generating training documentation during packaging (microsoft#15795) Update Conv-Add-Relu Fusion Transformation (microsoft#15834) Fix symbolic shape infer empty value_info (microsoft#15842) NhwcFusedConv: Add before Activation (microsoft#15837) use __hmul2 instead of __hmul2_rn (microsoft#15852) change the EP device to default OrtDevice() for memoryType equals CPU Input (microsoft#15903) Fixing NhwcFusedConv fp16 (microsoft#15950) fix topo sort in quantization tool (microsoft#16003) [doc] add LeakyRelu to coreml supported ops (microsoft#15944) [DML EP] Add frequent upload heap flushing (microsoft#15960) Co-authored-by: Yulong Wang Co-authored-by: dependabot[bot] Co-authored-by: Guenther Schmuelling Co-authored-by: Shalva Mist Co-authored-by: Maximilian Müller Co-authored-by: Dmitri Smirnov Co-authored-by: pengwa Co-authored-by: Ashwini Khade Co-authored-by: Edward Chen Co-authored-by: Jian Chen Co-authored-by: liqun Fu Co-authored-by: Baiju Meswani Co-authored-by: Tianlei Wu Co-authored-by: Chen Fu Co-authored-by: Ye Wang Co-authored-by: cao lei Co-authored-by: Yufeng Li Co-authored-by: Rachel Guo Co-authored-by: Patrice Vignola

fix: setting builder optimization level to correct TRT 8.6 default

5597450

This was referenced May 11, 2023

Update TRT EP doc #15906

Closed

Update TRT EP doc #15907

Merged

Merge branch 'microsoft:main' into trt_opt_lvl_defaukt

6bc803c

jywu-msft added the release:1.15 label May 11, 2023

jywu-msft requested a review from chilo-ms May 11, 2023 14:50

chilo-ms reviewed May 11, 2023

View reviewed changes

jywu-msft previously approved these changes May 11, 2023

View reviewed changes

fix missing changes

59dd499

chilo-ms approved these changes May 12, 2023

View reviewed changes

chilo-ms merged commit 1435510 into microsoft:main May 12, 2023

snnn added the triage:approved Approved for cherrypicks for release label May 18, 2023

snnn removed triage:approved Approved for cherrypicks for release release:1.15 labels May 19, 2023

Comments

Conversation

gedoensmax commented May 10, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

chilo-ms commented May 10, 2023

Uh oh!

chilo-ms commented May 10, 2023

Uh oh!

azure-pipelines bot commented May 10, 2023

Uh oh!

azure-pipelines bot commented May 10, 2023

Uh oh!

jywu-msft commented May 10, 2023

Uh oh!

azure-pipelines bot commented May 10, 2023

Uh oh!

chilo-ms commented May 11, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jywu-msft commented May 11, 2023

Uh oh!

jywu-msft commented May 11, 2023

Uh oh!

jywu-msft commented May 11, 2023

Uh oh!

azure-pipelines bot commented May 11, 2023

Uh oh!

azure-pipelines bot commented May 11, 2023

Uh oh!

azure-pipelines bot commented May 11, 2023

Uh oh!

chilo-ms May 11, 2023

Choose a reason for hiding this comment

Uh oh!

jywu-msft May 11, 2023

Choose a reason for hiding this comment

Uh oh!

jywu-msft May 11, 2023

Choose a reason for hiding this comment

Uh oh!

gedoensmax May 12, 2023

Choose a reason for hiding this comment

Uh oh!

jywu-msft May 12, 2023

Choose a reason for hiding this comment

Uh oh!

jywu-msft commented May 12, 2023

Uh oh!

jywu-msft commented May 12, 2023

Uh oh!

jywu-msft commented May 12, 2023

Uh oh!

azure-pipelines bot commented May 12, 2023

Uh oh!

azure-pipelines bot commented May 12, 2023

Uh oh!

azure-pipelines bot commented May 12, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

gedoensmax commented May 10, 2023 •

edited

Loading

chilo-ms commented May 11, 2023 •

edited

Loading