Fix OpenMP usage from MlasSgemm by tracysh · Pull Request #68 · microsoft/onnxruntime

tracysh · 2018-11-30T21:48:11Z

Fix the OpenMP usage of #pragma parallel inside MlasSgemm to use "#pragma omp parallel for" instead of "#pragma omp num_threads()". MlasSgemm computes the number of threads that can usefully do work fo the GEMM and I had thought the num_threads syntax was the cleanest way to schedule the work, but as the number of threads changes, OpenMP runtimes painfully re-create the thread pool. Using "#pragma omp parallel for" keeps the existing thread pool around and hopefully avoids waking threads that won't have any work to do.

* Stop passing --qairt-root in CI runs

Some PRs that use core/common/inlined_containers.h can cause failures in the CUDA CI pipeline. ``` E:\_work\_temp\build\RelWithDebInfo\vcpkg_installed\x64-windows-static-md\include\absl/hash/internal/hash.h(481): error #68-D: integer conversion resulted in a change of sign [E:\_work\_temp\build\RelWithDebInfo\onnxruntime_providers_cuda.vcxproj] sizeof(T) == -1, ^ Remark: The warnings can be suppressed with "-diag-suppress <warning-number>" E:\_work\_temp\build\RelWithDebInfo\vcpkg_installed\x64-windows-static-md\include\absl/hash/hash.h(337): error #549-D: variable "s" is used before its value is set [E:\_work\_temp\build\RelWithDebInfo\onnxruntime_providers_cuda.vcxproj] return s; ^ E:\_work\_temp\build\RelWithDebInfo\vcpkg_installed\x64-windows-static-md\include\absl/container/internal/raw_hash_set.h(468): error #69-D: integer conversion resulted in truncation [E:\_work\_temp\build\RelWithDebInfo\onnxruntime_providers_cuda.vcxproj] static_cast<uint16_t>(reinterpret_cast<uintptr_t>(&seed)); ^ 3 errors detected in the compilation of "E:/_work/onnxruntime/onnxruntime/onnxruntime/contrib_ops/cuda/sparse/block_mask.cu". ``` This change adds a patch to Abseil to mitigate those failures. This solution has been verified to be effective in PR #27087.

use #pragma parallel for properly

220ee40

tracysh requested a review from a team November 30, 2018 21:48

duli2012 approved these changes Nov 30, 2018

View reviewed changes

tracysh merged commit 2104484 into master Nov 30, 2018

tracysh deleted the tracysh/fixopenmpgemm branch November 30, 2018 22:11

AliceSum mentioned this pull request Mar 27, 2022

[Lot of people have this issue] The model can not "really" do inference; Loading issue is solved. #10990

Closed

lanyuer mentioned this pull request May 16, 2022

crashed in construction of Ort::Env, xcode13, iPhoneX #11446

Closed

hui-li-xf mentioned this pull request Jun 10, 2025

[Mobile] null pointer dereference: SIGSEGV #24991

Closed

quic-ankus pushed a commit to CodeLinaro/onnxruntime that referenced this pull request Nov 25, 2025

Enable use of LKG QAIRT (microsoft#68)

1b21915

* Stop passing --qairt-root in CI runs

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix OpenMP usage from MlasSgemm#68

Fix OpenMP usage from MlasSgemm#68
tracysh merged 1 commit intomasterfrom
tracysh/fixopenmpgemm

tracysh commented Nov 30, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

tracysh commented Nov 30, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants