Skip to content

Conversation

adrianlizarraga and others added 7 commits February 3, 2025 15:20
### Description
The Web CI pipeline uses three different Windows machine pools:
1. onnxruntime-Win2022-webgpu-A10
2. onnxruntime-Win2022-VS2022-webgpu-A10
3. onnxruntime-Win-CPU-2022-web

This PR merges them together to reduce ongoing maintenance cost.
### Description
* Fix `NumericLimits<float>` that used infinity as max, which is not
consistent with `std::numeric_limits<float>::max()`
In Windows, (float)(1e+300) is used for INFINITY, which causes compiler
error in Visual Studio 2022 v17.12 Preview 5.
* Rename `NumericLimits<T>::Min` to Lowest to be consistent with
std::numeric_limits
* Fix topk implementation: use `NumericLimits<CudaT>` instead of
`NumericLimits<T>` in kernel. That could avoid defining a confusing
defintion of `NumericLimits<MLFloat16>` that returns half instead of
MLFloat16.
* Use CUDART_MAX_NORMAL_FP16 if possible. It sets bits value directly,
which is faster than converting float to half.

Note that NumericLimits does not support __nv_bfloat16 and _nv_fp8_e4m3
and __nv_fp8_e5m2 right now.

### Motivation and Context
#22728
Replace INFINITY by `std::numeric_limits<float>::infinity()` to avoid
build errors with Visual Studio 2022 v17.12 Preview 5

### Motivation and Context
#22728
… default (#23368)

Makes the QNN provider option `offload_graph_io_quantization` enabled by
default. It was previously disabled by default.

Enabling this option significantly decreases inference latency for many
models.
### Description
Add a temporary path to RN 0.69.3 to update the boost url


### Motivation and Context
Fix the React-native CI until we update the RN to 0.70.15 or 0.73.3+
versions
@snnn
Copy link
Contributor

snnn commented Feb 5, 2025

Please also cherry-pick a25c931 and a910ced so that orttraining-linux-ci-pipeline will pass.

Changming Sun and others added 5 commits February 5, 2025 12:08
Move ORT Training pipeline to github actions and enable CodeQL scan for the code(including inference code).
We will move all pull request pipelines to Github Actions.
### Description
Move Linux Github actions to a dedicated pool. Currently the
"orttraining-linux-ci-pipeline " is too slow.

### Motivation and Context
To speed up the running.
Fix some inconsistency. 

All our iOS build should target iOS 15.1.
All our macOS desktop build should target macOS 13.3 to align with the
changes made in #17361
### Description
<!-- Describe your changes. -->



### Motivation and Context
ESRP code-sign task has supported .net 8, so we can remove macos-12
### Description
Removing net8.0-android target from the MAUI test project. This test is
not currently being run in the pipelines, but build issues is blocking
the patch release.

[Pipeline
run](https://aiinfra.visualstudio.com/Lotus/_build/results?buildId=674060&view=results)
(Still running into unrelated Failure to get the
"onnxruntimepackagetest" image errors)
@adrianlizarraga adrianlizarraga marked this pull request as ready for review February 7, 2025 00:14
@adrianlizarraga adrianlizarraga requested review from a team, HectorSVC, jchen351 and snnn February 7, 2025 00:14
@adrianlizarraga
Copy link
Contributor Author

/azp run Linux GPU CI Pipeline, orttraining-linux-gpu-ci-pipeline

@azure-pipelines
Copy link

Azure Pipelines failed to run 1 pipeline(s).

@jywu-msft jywu-msft merged commit 8608bf0 into rel-1.20.2 Feb 10, 2025
153 of 161 checks passed
@jywu-msft jywu-msft deleted the adrianl/rel-1.20-2-cherry-pick-1 branch February 10, 2025 16:55
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

8 participants