Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
267 commits
Select commit Hold shift + click to select a range
5ab953c
Quant tool: Add `nodes_to_exclude` in `get_qnn_qdq_config` (#23779)
jambayk Feb 27, 2025
0564265
[ORT/CI_Pipeline] Use --enable_generic_interface in ORT builds for EP…
karim-vad Feb 27, 2025
a189bfc
Increase npm package pipeline ReactNative_CI_iOS timeout to 120 mins …
jchen351 Feb 27, 2025
c61a4b1
[Mlas] Unblock hardcoded matmul blocking size (#23815)
fajin-corp Feb 27, 2025
2a4cfab
Revert changes onn mac-react-native-ci-pipeline.yml (#23845)
jchen351 Feb 27, 2025
1be64f8
Fix flash attention for GQA (Phi4) (#23850)
sushraja-msft Feb 28, 2025
1088a1e
Model Builder API (#23223)
skottmckay Feb 28, 2025
1ffe793
Fix typo: change `Upample` to `Upsample`. (#23838)
seungtaek94 Mar 1, 2025
0a6b05f
[doc] Fix typos in csharp/src/Microsoft.ML.OnnxRuntime/ (#23848)
co63oc Mar 1, 2025
daf9565
Quant tool: Consistent `get_qdq_config` and `get_qnn_qdq_config` beha…
jambayk Mar 2, 2025
99c51a3
Change the logic to generate the default ep context file name (#23788)
HectorSVC Mar 3, 2025
7f0c2c6
Make Nuget QNN package pipeline 1ES compliant (#23805)
jchen351 Mar 3, 2025
1872527
[js/common] allows using Uint16Array as data for float16 tensor (#23827)
fs-eire Mar 3, 2025
325ee30
[js/webgpu] Reland the optimization of ConvTranspose (#23858)
qjia7 Mar 4, 2025
30c6825
[OpenVINO] Fix a build warning (#23877)
asoldano Mar 4, 2025
bde4fbe
Change gsl::byte to std::byte (#23872)
Mar 4, 2025
17dcea7
Allow using extended minimal build for several EPs (#23834)
fs-eire Mar 4, 2025
813bdaa
Add dawn to ThirdPartyNotices (#23876)
fs-eire Mar 4, 2025
9d0dc9f
Enable QNN EP weight sharing generation using public API (#23702)
HectorSVC Mar 5, 2025
788ca51
[QNN-EP]: Fix inference failures while running with htp_shared_memory…
quic-ashigarg Mar 5, 2025
8aed920
Fix enable_pix_capture build for WebGPU (#23857)
jchen10 Mar 5, 2025
834adde
[WebGPU-EP Native] Add ReduceMean (#23860)
satyajandhyala Mar 5, 2025
cfb0a72
[WebGPU EP] introduce BiasAdd contrib op (#23861)
prathikr Mar 5, 2025
5e636a6
Dynamo export and improve benchmark script for SAM2 encoder (#23887)
tianleiwu Mar 5, 2025
aafa8d1
[js/web] improve workaround for bundlers (#23902)
fs-eire Mar 6, 2025
d35db9b
[webgpu] Restore MatMulNBits workgroup size for Phi-3.5 (#23349)
daijh Mar 6, 2025
95225dd
[webgpu] support Pad operator (#23141)
xhcao Mar 6, 2025
b524229
[WebNN] Accept Float16Array for float16 data type if it is available …
Honry Mar 6, 2025
996fffb
Ensure that the 'cmake_minimum_required' is version 3.5 or greater (#…
mschofie Mar 6, 2025
54b2d64
WebGPU: Remove deprecated subgroups-f16 from WebGPU native and JS EP …
jiangzhaoming Mar 6, 2025
ccf8fdd
[JSEP/WebGPU] Fixed error in softmax dispatch. (#23906)
satyajandhyala Mar 6, 2025
101353c
enable WebGPU EP in WebAssembly build (#23913)
fs-eire Mar 6, 2025
8f07743
Adding OpenVINO Windows CI Pipeline (#23919)
yihonglyu Mar 7, 2025
4bb79d1
[WebGPU EP] SoftMax Implementation (#23538)
vraspar Mar 7, 2025
b2ab87e
Exclude MAUI projects from GPU C# packaging builds (#23923)
skottmckay Mar 7, 2025
eeaf73b
Support all block sizes that are multiples of 32 for DP4A (#23907)
sushraja-msft Mar 7, 2025
c28bf78
Example custom op with output type inferencing (#23916)
skottmckay Mar 7, 2025
1199dc0
Enabling L2+ Optimizations for EPs (#23517)
chilo-ms Mar 7, 2025
2ba076a
fix binplace file in web pipeline (#23930)
fs-eire Mar 7, 2025
e47c6c1
Updated run_CIs_for_external_pr.py to support the Windows OpenVINO CI…
yihonglyu Mar 7, 2025
8969ee7
Fix ConvInteger handling of optional inputs. (#23935)
skottmckay Mar 7, 2025
26f590b
Updated ov version in pipeline (#595) (#23882)
saurabhkale17 Mar 7, 2025
f25deae
[AIX] External data handling (#23859)
ranjitshs Mar 7, 2025
593d5c0
Create a packaging pipeline for a custom nuget package (#23918)
baijumeswani Mar 7, 2025
7dbbfe0
Fix license in example test code. (#23936)
skottmckay Mar 7, 2025
ab38607
replace usage of gsl::narrow and gsl::narrow_cast in WebGPU EP (#23926)
fs-eire Mar 7, 2025
cffef2e
VCPKG improvement: set VCPKG_OSX_DEPLOYMENT_TARGET (#23933)
Mar 7, 2025
49328fe
Allow using a different version of flatbuffers when building with vcp…
Mar 7, 2025
95dcd15
Make python package pipeline 1ES compliant (#23800)
jchen351 Mar 8, 2025
989d417
Delete ROCM Nuget Publishing Pipeline (#23948)
jchen351 Mar 8, 2025
fe7634e
Bump SixLabors.ImageSharp from 2.1.9 to 2.1.10 in /csharp/sample/Micr…
dependabot[bot] Mar 10, 2025
246c219
Make python CUDA package pipeline 1ES compliant (#23802)
jchen351 Mar 10, 2025
773bb4f
Migrate yarn to npm (#22116)
jchen351 Mar 10, 2025
333fbdb
[WebGPU/JSEP] Support group query attention do_rotary attribute (#23524)
satyajandhyala Mar 10, 2025
f18e9fa
Fix npm audit in js/react-native/e2e (#23975)
jchen351 Mar 11, 2025
6443626
Suppress some warnings in WebGPU EP generated by GCC 13 (#23984)
fs-eire Mar 11, 2025
d010acb
Fix NPM audit in js/react-native (#23974)
jchen351 Mar 11, 2025
9118b1d
Bump axios from 1.7.9 to 1.8.2 in /js/node (#23963)
dependabot[bot] Mar 11, 2025
5672cf7
GCC 14: fix insert_or_assign() call (#23955)
stefantalpalaru Mar 12, 2025
d2bf9a7
ADD emsdk env vars to VCPKG_KEEP_ENV_VARS (#23997)
Mar 12, 2025
fe43537
Fix ONNX Runtime Python Test Pipeline (#23990)
jchen351 Mar 12, 2025
16d6f39
[webgpu] Fix the continuation issue (#23999)
qjia7 Mar 12, 2025
9891eb3
[WebGPU EP] Implements Gelu, BiasSplitGelu, and QuickGelu (#23981)
prathikr Mar 12, 2025
6dd6ef9
[Native WebGPU] Added ReduceMax and ReduceSum (#23934)
satyajandhyala Mar 12, 2025
47bd046
Convert Windows CPU CI Pipeline to Github Actions (#23996)
Mar 12, 2025
06482c2
[Fix] Dependencies find_package Eigen error (#23939)
mingyueliuh Mar 12, 2025
5e05729
Update onnxruntime_c_api.h to work with MinGW (#24006)
hsilm Mar 12, 2025
57ddd02
Add DNNL github workflow (#24011)
Mar 13, 2025
7ae606f
Qnn weight sharing improvement (#23945)
HectorSVC Mar 13, 2025
11216a4
Correct generated cmake syntax (#24016)
hans00 Mar 13, 2025
1362e7c
[webgpu] allow to specify UseIndicesTypeAlias for Indices (#24019)
fs-eire Mar 13, 2025
401f24a
[webgpu] allow overloads to Program::AddIndices (#24021)
fs-eire Mar 13, 2025
219c919
fix test for RotaryEmbedding (#24022)
fs-eire Mar 13, 2025
99b78a9
Fix attention bias broadcast (#24017)
tianleiwu Mar 13, 2025
5bd3163
Remove unused parameter in csharp InferenceTest (#24031)
tianleiwu Mar 13, 2025
6bb6d79
[TensorRT EP] Call cudaSetDevice at compute function for handling mul…
chilo-ms Mar 13, 2025
3f71d63
Increase timeout for ARM64-Xcode16-targeting-iphonesimulator (#24030)
edgchen1 Mar 13, 2025
1fc6d8c
Support tvOS build (#24000)
hans00 Mar 14, 2025
cb3f631
[TensorRT EP] Stop enforcing oss parser during Windows debug build (#…
yf711 Mar 14, 2025
9a296a0
Set CMAKE_POLICY_DEFAULT_CMP0069 to NEW to ensure that IPO flags are …
edgchen1 Mar 14, 2025
9f21456
Make Cuda packaging pipeline 1ES compliant (#23806)
jchen351 Mar 14, 2025
7c05e7f
[webgpu/wasm] allow runtime switch between WebGPUEP and JSEP (#24032)
fs-eire Mar 14, 2025
c9c8b48
Move call to MLAS_CPUIDINFO::GetCPUIDInfo() out of MlasSQNBitGemmDisp…
edgchen1 Mar 14, 2025
cc5840b
[webgpu] fix the wrong dispatch size in flash_attention (#24020)
xhcao Mar 14, 2025
41c239d
avoid copy unnecessary files for nodejs pkg (#23992)
fs-eire Mar 14, 2025
5a694bc
Add support for custom position ids and attention bias to GQA CPU ope…
derdeljan-msft Mar 14, 2025
73d9826
[WebNN] Better int64 integration (#23831)
Honry Mar 14, 2025
b896666
Convert Windows GPU pipelines and Windows OpenVino pipeline to Github…
Mar 14, 2025
f22ee08
[ARM CPU] Fix fp16 const initialization on no-fp16 platform (#23978)
fajin-corp Mar 14, 2025
ae501ee
[Native WebGPU EP] Add packedQKV and do_rotary attribute support to G…
satyajandhyala Mar 14, 2025
7942fa7
Whisper Redesigned Solution (#23549)
kunal-vaishnavi Mar 15, 2025
5ef0d21
Windows: Show more useful DLL load errors to say exactly what DLL is …
RyanUnderhill Mar 16, 2025
2bc73ca
Extend CMAKE_CUDA_FLAGS with all Blackwell compute capacity (#23928)
yf711 Mar 17, 2025
f5812d0
[WebGPU] Reduce staging buffers for uploading intializers (#23968)
jchen10 Mar 17, 2025
154e3b7
[WebGPU EP] Implement Remaining Reduction Ops (#24045)
prathikr Mar 17, 2025
a46d212
add bool support to EPContext schema to unblock some models (#24065)
HectorSVC Mar 18, 2025
b3aa5a3
[WebGPU EP] fix for reduce min/max error on MacOS CI (#24077)
prathikr Mar 18, 2025
e495750
Upgrade current MacOS-13 to 14 (#23293)
jchen351 Mar 18, 2025
c6a2675
Fix CUDA EP Abs and Sign bfloat16 support (#23914)
skottmckay Mar 18, 2025
12fea57
Improve typing for OrtValue and other public Python interfaces (#24086)
justinchuby Mar 18, 2025
a85977d
[webgpu] Limit that K must be divisible by 128 to apply dp4a matmul (…
qjia7 Mar 18, 2025
d98046b
Add macOS ARM64 pipeline for webgpu (#24060)
fs-eire Mar 19, 2025
eceae8b
[WebNN/WebGPU JS] Fix shared Module methods overriding each other (#2…
egalli Mar 19, 2025
7fc7d5e
Enable multithreading on FP16 to FP32 cast operator (#23619)
Mar 19, 2025
3488ba3
Move Android CI Pipeline to Github Actions (#24094)
Mar 19, 2025
7444fee
Cleanup CoreML EP's code to remove COREML_ENABLE_MLPROGRAM (#23490)
Mar 19, 2025
b626409
webgpu ep support for argmax/argmin (#24089)
guschmue Mar 19, 2025
d8ed4da
[mobile/reactnative] Remove namespace from AndroidManifest.XML to res…
carzh Mar 19, 2025
80441e4
[WebGPU EP] fix implementation of Pow (#24088)
fs-eire Mar 19, 2025
731b27e
Increase timeout to 90min for ARM64-Xcode16-targeting-iphonesimulator…
fs-eire Mar 19, 2025
da7874c
[WebGPU] fix test failure in Reduce operators on macOS ARM64 (#24108)
fs-eire Mar 19, 2025
8d21bf7
[WebGPU EP] Implements CumSum Operator (#24047)
prathikr Mar 19, 2025
81a8920
[webgpu] Use 1d dispatch group size (#24084)
qjia7 Mar 19, 2025
9dcb99c
[WebGPU] fix test failure in MatMulNBits on macOS ARM64 (#24109)
fs-eire Mar 20, 2025
4d5e274
[QNN-EP] Add support for Sum operator with 2 inputs (#24098)
chuteng-quic Mar 20, 2025
5d43f0a
[WebNN] Replace narrow with SafeInt for consistently in integer handl…
Honry Mar 20, 2025
6bdbf08
[QNN-EP] Add Lora Support with offline QNN context binary (#24026)
chuteng-quic Mar 20, 2025
440d17a
[TensorRT EP] support TensorRT 10.9-GA (#23905)
yf711 Mar 20, 2025
127c850
[webgpu] Apply dp4a for generation shader (#24064)
qjia7 Mar 20, 2025
db0c95c
[CUDA] Support slide window in cutlass fused attention (#24072)
tianleiwu Mar 20, 2025
16b0b32
[MIGraphX EP] rename HIPPinnedAllocator to MIGraphXPinnedAllocator (#…
apwojcik Mar 20, 2025
9922d48
[MIGraphX EP] check POLICY CMP0144 availability before used (#24104)
apwojcik Mar 20, 2025
469fb7e
[JSEP] handles edge case in gridsample operator (#24121)
prathikr Mar 20, 2025
49024a1
[OpenVINO]Session Options Appended After AppendExecutionProvider (#23…
sfatimar Mar 20, 2025
7a6514c
[webgpu]Add MaxPool and AveragePool (#23714)
jchen10 Mar 21, 2025
9e53afa
[webgpu EP] put GetMaxComponents and SumVector to one place. (#24122)
fs-eire Mar 21, 2025
dcc1f5a
skip MOE python test when MPI is not installed (#24116)
tianleiwu Mar 21, 2025
90c5ffb
Integrate KleidiAI for MatMulNBits via MlasQNBitGemm (#23627)
MichaelTylerArm Mar 21, 2025
0a363d9
add test cases for webgpu ep in web (#24117)
fs-eire Mar 21, 2025
cd9406b
Refactor Webnn IsSupported*() to use constant initializers. (#24118)
yuslepukhin Mar 21, 2025
4959468
Deleted the constant SKIP_CUDA_TEST_WITH_DML (#24113)
CodingSeaotter Mar 22, 2025
d84314c
Update T5 Onnx Export and Optimization (#23949)
tianleiwu Mar 23, 2025
3012d44
Update package.json to make the dist avaliable again (#23991)
jchen351 Mar 23, 2025
2b3d7fb
Fix attention QK linkage error (#24134)
kunal-vaishnavi Mar 24, 2025
5ed900e
Bump next from 15.1.2 to 15.2.3 in /js/web/test/e2e/exports/testcases…
dependabot[bot] Mar 24, 2025
2b5c9da
[Shape Inference] Add shape inference for QLinearAdd and QLinearMul o…
pravg-amd Mar 24, 2025
8eb8c2b
[mobile] Add Android NuGet BrowserStack test to NuGet packaging pipel…
carzh Mar 24, 2025
828e372
[CPU] Add fp16 support to sparse attention (#24015)
fajin-corp Mar 24, 2025
373b9e2
refactor mac CI pipelines (#24138)
fs-eire Mar 24, 2025
5244d68
Address Windows CUDA build issue (#24149)
yuslepukhin Mar 24, 2025
e03631e
[webgpu] add option to perserve device and enable in unittest (#24115)
fs-eire Mar 24, 2025
78d91cd
[js/web] allow bundler import condition for not bundling wasm (#24014)
fs-eire Mar 24, 2025
618aef7
[js] Add API for accessing metadata of a model's input/output (#23937)
fs-eire Mar 24, 2025
afaf4a5
add cache "onnxnodetests" for node tests (#24150)
fs-eire Mar 25, 2025
ce65e25
[Native WebGPU] Add Matmul (#24046)
vraspar Mar 25, 2025
bb005b9
Upgrade Big Model pipeline CUDA from 11.8 to 12.x (#24156)
tianleiwu Mar 25, 2025
de502c8
Proper Error Message when fp16 model is used for Beam Search in CPU (…
tianleiwu Mar 25, 2025
a4b8f11
Change type len from int to size_t (#24157)
jiafatom Mar 25, 2025
a8fb786
Limit the Pipeline ability to build cuda 11 (#24073)
jchen351 Mar 25, 2025
8680667
Move Linux CPU CI pipeline to Github Actions (#24154)
Mar 25, 2025
d9c961c
Bump vite from 6.2.1 to 6.2.3 in /js/web/test/e2e/exports/testcases/v…
dependabot[bot] Mar 25, 2025
1ef3044
[onnxruntime_perf_test] Fix custom_allocator_ destruction order. (#24…
edgchen1 Mar 25, 2025
25b06f2
Fix layout transformer for FusedConv (#24169)
fs-eire Mar 25, 2025
1f6dc88
Migrate Zip-Nuget Package Pipeline to 1ES (#23609) Also, kleidail is …
jchen351 Mar 25, 2025
9dbfee9
Update the min GCC version (#24148)
Mar 25, 2025
2a800d1
[QNN EP] ARM64EC python package remove --vcpkg in build (#24174)
jywu-msft Mar 26, 2025
a8673c6
[WebGPU EP] Add GEMM implementation (#24023)
xiaofeihan1 Mar 26, 2025
513e8de
[wasm] remove --vcpkg in wasm build (#24179)
fs-eire Mar 26, 2025
32b376c
revise mac os pipeline to reduce the amount of jobs (#24177)
fs-eire Mar 26, 2025
be1cfc4
fix triggering for "Validate Gradle Wrapper" pipeline (#24181)
fs-eire Mar 26, 2025
5d805c2
upgrade QNN to version 2.32.0.250228 (#23977)
HectorSVC Mar 26, 2025
24ece47
[JSEP] adjust edge case logic for scatternd (#24172)
prathikr Mar 26, 2025
1f70fc2
Make the custom nuget packaging pipeline 1ES commpliant. (#24191)
baijumeswani Mar 27, 2025
4d13b70
Disable KleidiAI in Python Packaging pipeline MacOS build (#24194)
edgchen1 Mar 27, 2025
041674a
Rolling back the python/cuda (#24170)
jchen351 Mar 27, 2025
914be22
Remove all CG template from pipelines (#24193)
jchen351 Mar 27, 2025
bd00c39
Move Linux ARM64 CI pipeline and Linux DNNL CI pipeline to Github Act…
Mar 27, 2025
86b4c78
[webgpu-ep] Fix test_batchnorm_example (#24184)
jchen10 Mar 27, 2025
2656671
Further reduce work load for Mac CI pipeline (#24197)
fs-eire Mar 27, 2025
64b0d07
Generate unique names for SliceSplit fusion. (#24217)
yuslepukhin Mar 27, 2025
2592147
Fix the pipeline that failed because of vcpkg (#24226)
fs-eire Mar 28, 2025
c756e0a
Improve Shape Inference for GQA (#24143)
peishenyan Mar 28, 2025
19d8d69
Add React Native namespace back in for iOS (#24218)
carzh Mar 28, 2025
180ba8f
RoPE fp16 avx (#23772)
liqunfu Mar 28, 2025
f430dce
Migrate Linux GPU pipelines to Github Actions (#24232)
Mar 28, 2025
41dde35
Migrate Web CI into github actions (#24219)
fs-eire Mar 28, 2025
4a669fd
update the readme doc for the tool ep_weight_sharing_ctx_gen (#24233)
HectorSVC Mar 29, 2025
7ef0ddc
[WebGPU EP] If Implementation for WebGPU EP (#24242)
prathikr Mar 29, 2025
8de342a
Update linux-dnnl.yml: rename the pipeline (#24240)
Mar 29, 2025
d71aa4d
[webgpu] Fix test_layer_normalization_2d_axis0 (#24223)
jchen10 Mar 29, 2025
f1d790c
[webgpu] fix LayerNorm with empty input (#24244)
fs-eire Mar 29, 2025
492af7a
Bump actions/setup-python from 4 to 5 (#24251)
dependabot[bot] Mar 31, 2025
83650ed
Bump actions/cache from 3 to 4 (#24250)
dependabot[bot] Mar 31, 2025
22787ae
[QNN EP] Add platform-agnostic EP option to specify QNN backend, `bac…
edgchen1 Mar 31, 2025
ad2e565
[webgpu] Fix opset-12 softmax nhwc issue (#24227)
xhcao Mar 31, 2025
528f29a
Extend pyright exclude list in pyproject.toml (#24246)
fs-eire Mar 31, 2025
ba2999c
[js/web] Add Wasm Relaxed SIMD support to wasm backend (#22794)
jing-bao Mar 31, 2025
4eeefd7
Add shader key validation step in WebGPU CI pipeline (#24243)
fs-eire Mar 31, 2025
30115cf
upgrade dawn version to 4cb1f9be152a4fa6bb695c08cd707ab078a1e2fb (#24…
fs-eire Mar 31, 2025
5982430
Bump dsaltares/fetch-gh-release-asset from 1.1.0 to 1.1.2 (#24248)
dependabot[bot] Mar 31, 2025
e227415
Bump vite from 6.2.3 to 6.2.4 in /js/web/test/e2e/exports/testcases/v…
dependabot[bot] Apr 1, 2025
5068ab9
[WebGPU EP] fixes bugs in split implementation (#24259)
prathikr Apr 1, 2025
1b48cc4
Bump microsoft/onnxruntime-github-actions from 35f8bd42417991aa46577e…
dependabot[bot] Apr 1, 2025
5b08055
Update xcode and iphoneSimulatorVersion after MacOS-14 (#24260)
jchen351 Apr 1, 2025
24620e7
Exclude onnxruntime-inference-examples directory from Component Gover…
jchen351 Apr 1, 2025
67216c8
[VitisAI] Fixed include error. (#24199)
BoarQing Apr 2, 2025
a5bc69c
Migrate pull:wasm to github action (#24269)
fs-eire Apr 2, 2025
b379390
Ensure to use correct GPU device in RunSince when it's invoked by new…
chilo-ms Apr 2, 2025
b5d15bc
Adding build-system to pyproject.toml (#24216)
jchen351 Apr 2, 2025
bc7b07d
[WebGPU EP] Implements ceil mode for Average Pool (#24270)
prathikr Apr 2, 2025
55aa03c
Pin vcpkg version (#24284)
Apr 3, 2025
a14d586
Support load TensorRT V3 plugin (#24211)
toothache Apr 3, 2025
21db38c
Expose TRT preview features as EP option (#24212)
toothache Apr 3, 2025
8465ca3
[webgpu] test_layer_normalization_3d_axis0_epsilon (#24276)
jchen10 Apr 3, 2025
7a55188
[webgpu][dawn API optimization] reduce number of calls to wgpuDeviceH…
fs-eire Apr 3, 2025
d238813
Bump next from 15.2.3 to 15.2.4 in /js/web/test/e2e/exports/testcases…
dependabot[bot] Apr 3, 2025
cbaa8bc
Bump image-size from 1.1.1 to 1.2.1 in /js/react_native/e2e (#24278)
dependabot[bot] Apr 3, 2025
a28da4b
[QNN-EP] Enhance QNN-EP support for Softmax with opset < 13. (#24180)
zhaoxul-qti Apr 3, 2025
e5e906e
Update publish-nuget.yml to correct feed. (#24299)
jchen351 Apr 4, 2025
3dfc2ae
[webgpu] Optimize MatMulNBits for f16 Block32 prefill performance (#2…
daijh Apr 4, 2025
82c8e56
upgrade action shellcheck to v1.30.0 (#24304)
fs-eire Apr 4, 2025
1cb53d0
[QNN-EP] Fix ONNX context model helper. (#24271)
minfhong-quic Apr 4, 2025
318cc87
[WebGPU] fix Pad cache key (#24305)
fs-eire Apr 4, 2025
56f1018
Bump vite from 6.2.4 to 6.2.5 in /js/web/test/e2e/exports/testcases/v…
dependabot[bot] Apr 4, 2025
2e94c5a
[WebGPU] fix cache key of AttentionProbs/VxAttentionScore (#24309)
fs-eire Apr 4, 2025
e944379
Support Gemma3 with Clip fused attention (#24280)
titaiwangms Apr 4, 2025
11fda2a
Update packaging pipeline for Nodejs binding (#24301)
fs-eire Apr 4, 2025
a4976e3
Add support for uint8_t as data type for GatherBlockQuantized (#24239)
sushraja-msft Apr 4, 2025
9102aae
[Native WebGPU] Add Conv, ConTranspose and FusedConv (#24186)
satyajandhyala Apr 4, 2025
a7e62d6
[webgpu][dawn API optimization] reduce number of calls to wgpuDeviceG…
fs-eire Apr 4, 2025
55c1a3b
Fix 'minimal_power' to 'minimum_power' for DirectML performance selec…
virajwad Apr 5, 2025
d6df4f2
Add ConvTranspose cache key (#24317)
satyajandhyala Apr 5, 2025
a1186f6
[webgpu] Use 1D dispatch groups for attention (#24228)
qjia7 Apr 7, 2025
73676fc
[webgpu][dawn API optimization] reduce number of calls to buffer APIs…
fs-eire Apr 7, 2025
350d140
Implement load cancellation ability (#24257)
yuslepukhin Apr 7, 2025
ca1b32d
[webgpu] Fix ROUND_PREFER_CEIL issue of Resize operator (#24229)
xhcao Apr 7, 2025
b803429
[Native WebGPU] Exclude WebGPU EP from ConvFp16 3D tests. (#24327)
satyajandhyala Apr 7, 2025
554fb4a
[VitisAI EP] export InferShapes to VitisAIEP (#23881)
zz002 Apr 8, 2025
18f91e5
[webgpu] Flash attention for generation (#23808)
qjia7 Apr 8, 2025
04e0b50
Use WASM f32x4 relaxed min/max for relaxed simd build (#24324)
fanchenkong1 Apr 8, 2025
f83e661
webgpu support for DequantizeLinear (#24268)
guschmue Apr 8, 2025
10e51d2
[webgpu] fix the reflect mode issue of Pad (#24202)
xhcao Apr 8, 2025
4edada6
Remove explicit batch network flag for TRT 10+ (#24298)
kevinch-nv Apr 8, 2025
2265613
[webgpu] Fix bias_split_gelu (#24342)
jchen10 Apr 8, 2025
34abb8b
[webgpu] fix bias-add (#24336)
jchen10 Apr 8, 2025
0acb048
[webgpu] optimize SkipLayerNormalization operator (#24164)
xhcao Apr 8, 2025
d7a38a5
ROCm: Remove -Wno-interference-size compiler flag (#24326)
jagadish-amd Apr 8, 2025
39e585f
[web] revise flag `ort.env.wasm.simd` (#24314)
fs-eire Apr 8, 2025
311f771
Cherry-picks into rel-1.21.0 (#23846)
amarin16 Feb 28, 2025
5f2d93b
[OVEP] Config 1 Commits
jatinwadhwa921 Feb 28, 2025
72bf43b
Stateful PoC: Perform stateless -> stateful conversion of ov::Model
RyanMetcalfeInt8 Mar 26, 2025
f4cc9f3
update: Update enable_causallm provider option for stateful poc
ankitm3k Apr 3, 2025
cfbfc0c
fix: Refactor Stateful Code
ankitm3k Apr 3, 2025
1ba8b20
update: Refactor, fix & enable EPContext Import for XML & BIN
ankitm3k Apr 4, 2025
c858abc
Support chat-mode for NPU
RyanMetcalfeInt8 Apr 7, 2025
a0c11c7
Support KVCache rewind for stateful LLMs via SetEpDynamicOptions
RyanMetcalfeInt8 Apr 7, 2025
7a8dbe2
ov_interface.cc: Fix compilation errors in Debug build (#642)
RyanMetcalfeInt8 Apr 8, 2025
1ff02d1
fix: disable ep_sdk_version check (#645)
ankitm3k Apr 8, 2025
360ddf5
fix: Refactor read_model() to accept XML path (#646)
ankitm3k Apr 9, 2025
dd7cb8f
fix: resolve merge conflicts
ankitm3k Apr 10, 2025
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
2 changes: 1 addition & 1 deletion .github/ISSUE_TEMPLATE/03-mobile.yml
Original file line number Diff line number Diff line change
Expand Up @@ -131,6 +131,6 @@ body:
id: ep-version
attributes:
label: Execution Provider Library Version
placeholder: ex. CUDA 11.6 or ROCm 5.1.1
placeholder: ex. CUDA 12.2 or ROCm 5.1.1
validations:
required: false
2 changes: 1 addition & 1 deletion .github/ISSUE_TEMPLATE/05-performance.yml
Original file line number Diff line number Diff line change
Expand Up @@ -127,7 +127,7 @@ body:
id: ep-version
attributes:
label: Execution Provider Library Version
placeholder: ex. CUDA 11.6 or ROCm 5.1.1
placeholder: ex. CUDA 12.2 or ROCm 5.1.1
validations:
required: false
- type: textarea
Expand Down
2 changes: 1 addition & 1 deletion .github/ISSUE_TEMPLATE/06-training.yml
Original file line number Diff line number Diff line change
Expand Up @@ -72,6 +72,6 @@ body:
id: ep-version
attributes:
label: Execution Provider Library Version
placeholder: ex. CUDA 11.6 or ROCm 5.1.1
placeholder: ex. CUDA 12.2 or ROCm 5.1.1
validations:
required: false
2 changes: 1 addition & 1 deletion .github/ISSUE_TEMPLATE/08-general.yml
Original file line number Diff line number Diff line change
Expand Up @@ -125,6 +125,6 @@ body:
id: ep-version
attributes:
label: Execution Provider Library Version
placeholder: ex. CUDA 11.6 or ROCm 5.1.1
placeholder: ex. CUDA 12.2 or ROCm 5.1.1
validations:
required: false
18 changes: 18 additions & 0 deletions .github/actions/install-onnxruntime-wheel/action.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,18 @@
name: 'Install ONNX Runtime Wheel'
description: 'Uninstalls existing ONNX Runtime packages and installs a wheel from a specified directory.'
inputs:
whl-directory:
description: 'The directory containing the ONNX Runtime wheel files.'
required: true
runs:
using: 'composite'
steps:
- name: Uninstall onnxruntime packages
shell: pwsh
run: |
python -m pip uninstall -y onnxruntime onnxruntime-gpu onnxruntime-training onnxruntime-directml -qq

- name: Install onnxruntime wheel from specified directory
shell: pwsh
run: |
Get-ChildItem -Path ${{ inputs.whl-directory }}/*.whl | foreach {pip --disable-pip-version-check install --upgrade $_.fullname}
71 changes: 71 additions & 0 deletions .github/actions/linux-web-init-and-check/action.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,71 @@
name: "Web Build Initialize and Check on Linux"
description: "Initializes and checks the ONNX Runtime Web build on Linux."
runs:
using: "composite"
steps:
- name: Setup Node.js
uses: actions/setup-node@v3
with:
node-version: "22.x"

- name: npm ci /js/
run: npm ci
shell: bash
working-directory: ${{ github.workspace }}/js

- name: npm ci /js/common/
run: npm ci
shell: bash
working-directory: ${{ github.workspace }}/js/common

- name: run onnxruntime-common tests
run: npm test
shell: bash
working-directory: ${{ github.workspace }}/js/common

- name: run onnxruntime-common tests (enable Float16Array)
run: npm run test:f16
shell: bash
working-directory: ${{ github.workspace }}/js/common

- name: npm ci /js/web/
run: npm ci
shell: bash
working-directory: ${{ github.workspace }}/js/web

- name: run TypeScript type check in /js/web/
run: npm run prebuild
shell: bash
working-directory: ${{ github.workspace }}/js/web

- name: run ESLint
run: npm run lint
shell: bash
working-directory: ${{ github.workspace }}/js

- name: Format code
run: npm run format
shell: bash
working-directory: ${{ github.workspace }}/js

- name: Check unformatted files
run: |
node -e "a=require('child_process').execSync('git diff --name-only').toString();if(a)throw new Error('Following source files are not formatted: (did you run \"npm run format\"?)\n'+a)"
shell: bash
working-directory: ${{ github.workspace }}/js

- name: TypeDoc Validation
run: npx typedoc --emit none --treatWarningsAsErrors
shell: bash
working-directory: ${{ github.workspace }}/js/common

- name: Generating documents
run: npm run build:doc
shell: bash
working-directory: ${{ github.workspace }}/js/web

- name: Check out of dated documents
run: |
node -e "a=require('child_process').execSync('git diff --name-only').toString();if(a)throw new Error('Following documents are not up-to-date: (did you run \"npm run build:doc\"?)\n'+a)"
shell: bash
working-directory: ${{ github.workspace }}/js/web
80 changes: 80 additions & 0 deletions .github/actions/locate-vcvarsall-and-setup-env/action.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,80 @@
name: 'Locate vcvarsall and Setup Environment'
description: 'Locates vcvarsall.bat, sets up the environment, and handles PATH updates.'
inputs:
architecture:
description: 'Target architecture (x64 or x86)'
required: true
default: 'x64'
outputs:
vcvarsall_path:
description: "Path to vcvarsall.bat"
value: ${{ steps.find-vcvarsall.outputs.vcvarsall_path }}
runs:
using: "composite"
steps:

- name: Setup VCPKG
uses: microsoft/onnxruntime-github-actions/[email protected]
with:
vcpkg-version: '2025.03.19'
vcpkg-hash: '17e96169cd3f266c4716fcdc1bb728e6a64f103941ece463a2834d50694eba4fb48f30135503fd466402afa139abc847ef630733c442595d1c34979f261b0114'
cmake-version: '3.31.6'
cmake-hash: '0f1584e8666cf4a65ec514bd02afe281caabf1d45d2c963f3151c41484f457386aa03273ab25776a670be02725354ce0b46f3a5121857416da37366342a833a0'
add-cmake-to-path: 'true'
disable-terrapin: 'false'

- name: Verify vcpkg setup
shell: pwsh # Use powershell to easily access env var
run: |
Write-Host "VCPKG_INSTALLATION_ROOT is set to: $env:VCPKG_INSTALLATION_ROOT"
& "$env:VCPKG_INSTALLATION_ROOT/vcpkg" version

- name: Find vcvarsall.bat
id: find-vcvarsall
shell: python # Use Python shell
run: |
import os
import subprocess

vswhere_path = os.path.join(os.environ["ProgramFiles(x86)"], "Microsoft Visual Studio", "Installer", "vswhere.exe")

try:
process = subprocess.run([vswhere_path, "-latest", "-property", "installationPath"], capture_output=True, text=True, check=True)
vs_install_path = process.stdout.strip()
vcvarsall_path = os.path.join(vs_install_path, "VC", "Auxiliary", "Build", "vcvarsall.bat")

if os.path.exists(vcvarsall_path):
print(f"vcvarsall found at: {vcvarsall_path}")
# Use GITHUB_OUTPUT environment variable
with open(os.environ['GITHUB_OUTPUT'], 'a') as f:
f.write(f"vcvarsall_path={vcvarsall_path}\n")
else:
print(f"vcvarsall.bat not found at expected path: {vcvarsall_path}")
# Use 'exit(1)' for Python to properly signal failure to GitHub Actions
exit(1)


except subprocess.CalledProcessError as e:
print(f"Error running vswhere.exe: {e}")
print(f"vswhere output: {e.stdout}")
print(f"vswhere stderr: {e.stderr}")
exit(1) # Exit with a non-zero code on error
except FileNotFoundError:
print(f"vswhere.exe not found at: {vswhere_path}")
exit(1)


- name: Setup Environment
shell: cmd
run: |
REM Get initial environment variables
set > initial_env.txt

REM Call vcvarsall.bat using the output from the previous step
call "${{ steps.find-vcvarsall.outputs.vcvarsall_path }}" ${{ inputs.architecture }}

REM Get environment variables after calling vcvarsall.bat
set > final_env.txt

REM Call the Python script to update the GitHub Actions environment
python ${{ github.action_path }}\update_environment.py
Original file line number Diff line number Diff line change
@@ -0,0 +1,30 @@
@echo off
setlocal

set vswherepath="%ProgramFiles(x86)%\Microsoft Visual Studio\Installer\vswhere.exe"
set vcvarsall_arch=%1
if "%vcvarsall_arch%" == "x86" (
set vcvarsall_arch=x86
) else (
set vcvarsall_arch=x64
)

for /f "usebackq delims=" %%i in (`%vswherepath% -latest -property installationPath`) do (
if exist "%%i\VC\Auxiliary\Build\vcvars%vcvarsall_arch%.bat" (
set "vcvarsall=%%i\VC\Auxiliary\Build\vcvars%vcvarsall_arch%.bat"
)
)

echo "Get initial environment variables"
set > initial_env.txt

echo "Call vcvarsall.bat"
call "%vcvarsall%"

echo "Get environment variables after calling vcvarsall.bat"
set > final_env.txt

echo "Call the Python script to update the GitHub Actions environment"
python "%~dp0\update_environment.py"

endlocal
Original file line number Diff line number Diff line change
@@ -0,0 +1,37 @@
import os
import re


def read_env_file(filepath):
env_vars = {}
with open(filepath) as f:
for line in f:
match = re.match(r"^(.*?)=(.*)$", line.strip())
if match:
env_vars[match.group(1).upper()] = match.group(2)
return env_vars


initial_env = read_env_file("initial_env.txt")
final_env = read_env_file("final_env.txt")

for key, value in final_env.items():
if key not in initial_env or initial_env[key] != value:
if key.startswith("_"):
continue
if key.upper() == "PATH":
new_paths = value.split(";")
initial_paths = initial_env.get("PATH", "").split(";")
added_paths = [p for p in new_paths if p not in initial_paths and p]

if added_paths:
print("Adding paths")
with open(os.environ["GITHUB_PATH"], "a") as f:
for path in added_paths:
print(f"Adding PATH: {path}")
f.write(path + os.linesep)
else:
# Use GITHUB_ENV
with open(os.environ["GITHUB_ENV"], "a") as f:
print(f"Setting {key}={value}\n")
f.write(f"{key}={value}\n")
79 changes: 79 additions & 0 deletions .github/actions/macos-ci-setup/action.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,79 @@
name: "macOS CI pipeline setup steps"
description: "Common setup steps for macOS CI pipelines"

inputs:
platform_machine:
required: false
type: string
default: "arm64"
python_version:
required: false
type: string
default: "3.11"
node_version:
required: false
type: string
default: "20.x"
java_version:
required: false
type: string
default: "17"
xcode_version:
required: false
type: string
default: "16"
use_cache:
required: false
type: boolean
default: false

runs:
using: "composite"
steps:
- name: Use Python
uses: actions/setup-python@v5
with:
python-version: ${{ inputs.python_version }}

- name: Verify machine architecture
shell: python
run: |
import platform
print(f"Running on {platform.machine()}")
assert platform.machine().lower() == "${{ inputs.platform_machine}}", "This job expects to be run on an ${{ inputs.platform_machine}} machine."

- name: Use Node.js
uses: actions/setup-node@v4
with:
node-version: ${{ inputs.node_version }}

- name: Install coreutils and ninja
shell: bash
run: brew install coreutils ninja

- name: Install Java
uses: actions/setup-java@v4
with:
distribution: "temurin"
java-version: ${{ inputs.java_version }}

- name: Use Xcode ${{ inputs.xcode_version }}
shell: bash
run: |
XCODE_DEVELOPER_DIR="/Applications/Xcode_${{ inputs.xcode_version }}.app/Contents/Developer"
sudo xcode-select --switch "${XCODE_DEVELOPER_DIR}"

- name: Export GitHub Actions cache environment variables
if: ${{ inputs.use_cache }}
uses: actions/github-script@v7
with:
script: |
core.exportVariable('ACTIONS_CACHE_URL', process.env.ACTIONS_CACHE_URL || '');
core.exportVariable('ACTIONS_RUNTIME_TOKEN', process.env.ACTIONS_RUNTIME_TOKEN || '');

- name: Install python dependencies
shell: bash
working-directory: ${{ github.workspace }}
run: |
python -m pip install --upgrade pip
python -m pip install -r requirements-dev.txt
Loading
Loading