Backmerging with Msft commits #624

jatinwadhwa921 · 2025-03-20T15:15:46Z

Backmerging with Msft commits

…t#24065) ### Description add bool support to EPContext schema to unblock some models

### Error ```Traceback /onnxruntime/onnxruntime/core/providers/webgpu/reduction/reduction_ops.cc:146 [allow_multi_axes = true] Axes values must be in the range [-rank, rank-1]. Got: 446098880 ```

### Description Upgrade current MacOS-13 to 14 ### Motivation and Context  - [x] Update the RN to 0.73.x+ to have the newer version of boost --------- Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

### Description  Abs and Sign had bfloat16 kernels created but not registered with the CUDA EP. Additionally Sign bfloat16 didn't work. * register bfloat16 kernels with CUDA EP * fix incorrectly named macro by adding 'X' as they add bfloat16 registration * add specialization for bfloat16 to _Sign * copied existing pattern. not sure if there's a better way * update tests ### Motivation and Context  microsoft#23875

…soft#24086) ### Description Improve the OrtValue interface typing and changed `staticmethod` to `classmethod` for constructors to follow python conventions (https://google.github.io/styleguide/pyguide.html#2174-decision).

…icrosoft#24078) The DP4AMatMulQuantize shader needs to make sure that K is divisible by 128. Otherwise, we need align the scale to have shape [M, ceil(K / 128)]. To simplify the shader, we limit that K must be divisible by 128 to apply dp4a matmul.

### Description Add macOS ARM64 pipeline for webgpu. This pipeline is a temporary one. I created this pipeline because the current code already fails on macOS ARM64 for WebGPU EP. Adding this pipeline allows to check the status of the fix, and eventually when the build passes, this pipeline will be merged with the existing macOS arm64 pipeline.

…crosoft#23998) - Renamed all conflicting WebNN methods from `jsep*` to `webnn*`. - WebNN doesn't need flush(), therefore it doesn't need to set `jsepBackend`. This PR addresses issue microsoft/webnn-developer-preview#78

### Description Enables multithreading on FP16 to FP32 cast operator. ### Motivation and Context Improves CPU performance on FP16 models that require casting to FP32.

### Description Move Android CI Pipeline to Github Actions

…#23490) ### Description Cleanup CoreML EP's code to remove the COREML_ENABLE_MLPROGRAM macro. Also, increase MINIMUM_COREML_VERSION(first version we support) to 5 .

…olve warning (microsoft#23847) ### Description Removes namespace from AndroidManifest.XML ### Motivation and Context - Resolves microsoft#21681

### Description Use custom implementation for Pow to fix test failures.

…microsoft#24091) ### Description  There are still some timeout for the pipeline. further extend the timeout to 90 minutes for ARM64-Xcode16-targeting-iphonesimulator. It takes quite a while if all build cache is missing. ### Motivation and Context The pipeline sometimes failed because of timeout. There is a previous PR microsoft#24030 to increase the timeout from 60min to 75 min but it looks like not enough.

…ft#24108) ### Description fix test failure in Reduce operators on macOS ARM64 ``` [E:onnxruntime:ReduceL1, sequential_executor.cc:572 ExecuteKernel] Non-zero status code returned while running ReduceL1 node. Name:'node1' Status Message: webgpu_context.cc:259 Run Uniform variable[0] (output_size) data type mismatch in program "ReduceL1", Expected: u32, Actual: i32 ```

Increases WebGPU EP op coverage.

This PR uses 1d disptach group size and uses workgroup_idx instead of workgroup.x|workgroup.y in case they are normalized.

) ### Description abs_error is slightly loosen from 0.02 to 0.03 to allow test cases on macOS arm64 to pass.

### Description  * Add Sum to op builder in QNN-EP * Now we can limit the support to Sum with 2 inputs. ### Motivation and Context  * Enhance QNN-EP support for Sum with two inputs

…ing (microsoft#24059) Remove redundant header files BTW.

HectorSVC and others added 22 commits March 18, 2025 08:22

add bool support to EPContext schema to unblock some models (microsof…

a46d212

…t#24065) ### Description add bool support to EPContext schema to unblock some models

[WebGPU EP] fix for reduce min/max error on MacOS CI (microsoft#24077)

b3aa5a3

### Error ```Traceback /onnxruntime/onnxruntime/core/providers/webgpu/reduction/reduction_ops.cc:146 [allow_multi_axes = true] Axes values must be in the range [-rank, rank-1]. Got: 446098880 ```

Enable multithreading on FP16 to FP32 cast operator (microsoft#23619)

7fc7d5e

### Description Enables multithreading on FP16 to FP32 cast operator. ### Motivation and Context Improves CPU performance on FP16 models that require casting to FP32.

Move Android CI Pipeline to Github Actions (microsoft#24094)

3488ba3

### Description Move Android CI Pipeline to Github Actions

Cleanup CoreML EP's code to remove COREML_ENABLE_MLPROGRAM (microsoft…

7444fee

…#23490) ### Description Cleanup CoreML EP's code to remove the COREML_ENABLE_MLPROGRAM macro. Also, increase MINIMUM_COREML_VERSION(first version we support) to 5 .

webgpu ep support for argmax/argmin (microsoft#24089)

b626409

[mobile/reactnative] Remove namespace from AndroidManifest.XML to res…

d8ed4da

…olve warning (microsoft#23847) ### Description Removes namespace from AndroidManifest.XML ### Motivation and Context - Resolves microsoft#21681

[WebGPU EP] fix implementation of Pow (microsoft#24088)

80441e4

### Description Use custom implementation for Pow to fix test failures.

[WebGPU EP] Implements CumSum Operator (microsoft#24047)

8d21bf7

Increases WebGPU EP op coverage.

[webgpu] Use 1d dispatch group size (microsoft#24084)

81a8920

This PR uses 1d disptach group size and uses workgroup_idx instead of workgroup.x|workgroup.y in case they are normalized.

[WebGPU] fix test failure in MatMulNBits on macOS ARM64 (microsoft#24109

9dcb99c

) ### Description abs_error is slightly loosen from 0.02 to 0.03 to allow test cases on macOS arm64 to pass.

[WebNN] Replace narrow with SafeInt for consistently in integer handl…

5d43f0a

…ing (microsoft#24059) Remove redundant header files BTW.

Merge branch 'master' into sync_msft_20_3_25

9ee95d1

jatinwadhwa921 requested a review from ankitm3k March 20, 2025 15:15

ankitm3k approved these changes Mar 20, 2025

View reviewed changes

jatinwadhwa921 merged commit 2a24806 into ovep-develop Mar 21, 2025
6 of 11 checks passed

jatinwadhwa921 deleted the sync_msft_20_3_25 branch April 15, 2025 05:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Backmerging with Msft commits #624

Backmerging with Msft commits #624

Uh oh!

jatinwadhwa921 commented Mar 20, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

15 participants

Backmerging with Msft commits #624

Backmerging with Msft commits #624

Uh oh!

Conversation

jatinwadhwa921 commented Mar 20, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

15 participants