Extend optimize_for_ort to cover passes #2274

titaiwangms · 2025-05-05T22:11:43Z

A draft for discussion. We should cover all post-processing the model shipping needs

codecov · 2025-05-05T22:15:09Z

Codecov Report

Attention: Patch coverage is 33.33333% with 4 lines in your changes missing coverage. Please review.

Project coverage is 73.77%. Comparing base (ac87a1c) to head (aa1e4a3).

Files with missing lines	Patch %	Lines
onnxscript/rewriter/ort_fusions/_core.py	33.33%	4 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #2274      +/-   ##
==========================================
- Coverage   73.78%   73.77%   -0.01%     
==========================================
  Files         235      235              
  Lines       30936    30939       +3     
  Branches     3494     3494              
==========================================
  Hits        22825    22825              
- Misses       6911     6914       +3     
  Partials     1200     1200

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

gramalingam · 2025-05-05T23:14:34Z

Please also consider whether this method should be optimize in-place or not. I think we can make it in-place now that shape-inference itself is in-place.

justinchuby · 2025-05-07T16:05:56Z

onnxscript/rewriter/ort_fusions/_core.py

    # Apply the ORT pattern rewrite rules.
    rewrite(model, ORT_PATTERN_REWRITE_RULES)
+
+    # TODO(exporter team): Fold transpose into initializers
+    # Apply the ORT optimization passes.
+    # https://github.com/microsoft/onnxruntime/blob/74dcf7e296639095dfa55d31336998b6f719ed76/onnxruntime/python/tools/transformers/dynamo_onnx_helper.py#L172
+    common_passes.ClearMetadataAndDocStringPass()(model)


You may put all the passes into a pass manager like we do in optimize()

justinchuby · 2025-05-07T16:06:55Z

Please also consider whether this method should be optimize in-place or not. I think we can make it in-place now that shape-inference itself is in-place.

I think making it out-of-place is safer, in case we have passes in the future that need to be functional?

titaiwangms · 2025-05-08T22:19:04Z

onnxscript/rewriter/ort_fusions/_core.py

+        # https://github.com/microsoft/onnxruntime/blob/74dcf7e296639095dfa55d31336998b6f719ed76/onnxruntime/python/tools/transformers/dynamo_onnx_helper.py#L172
+        common_passes.ClearMetadataAndDocStringPass(),
+        # https://github.com/microsoft/onnxruntime/blob/74dcf7e296639095dfa55d31336998b6f719ed76/onnxruntime/python/tools/transformers/dynamo_onnx_helper.py#L139
+        common_passes.LiftConstantsToInitializersPass(lift_all_constants=False, size_limit=1),


We have another pass called LiftSubgraphInitializersToMainGraphPass. Do we know if it's needed in genAI? @kunal-vaishnavi

If the pass logic is in DynamoOnnxHelper, then it is used for ONNX Runtime GenAI.

We don't really produce graphs with subgraph initializers. I think we are ok either way

draft

322851a

github-project-automation bot added this to ONNX Script Review Board May 5, 2025

github-project-automation bot moved this to Todo in ONNX Script Review Board May 5, 2025

titaiwangms requested review from gramalingam, shubhambhokare1, justinchuby and xadupre May 5, 2025 22:11

titaiwangms added the topic: api label May 5, 2025

justinchuby reviewed May 7, 2025

View reviewed changes

use pass manager

0226ed7

titaiwangms commented May 8, 2025

View reviewed changes

titaiwangms added 2 commits May 8, 2025 15:20

Merge branch 'main' into titaiwang/add_default_passes_to_ort_fusion

42ecb33

Merge branch 'main' into titaiwang/add_default_passes_to_ort_fusion

aa1e4a3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Extend optimize_for_ort to cover passes #2274

Extend optimize_for_ort to cover passes #2274

titaiwangms commented May 5, 2025 •

edited

Loading

codecov bot commented May 5, 2025 •

edited

Loading

gramalingam commented May 5, 2025

justinchuby May 7, 2025

titaiwangms May 8, 2025

justinchuby commented May 7, 2025

titaiwangms May 8, 2025

kunal-vaishnavi May 8, 2025

justinchuby May 9, 2025

Extend optimize_for_ort to cover passes #2274

Are you sure you want to change the base?

Extend optimize_for_ort to cover passes #2274

Conversation

titaiwangms commented May 5, 2025 • edited Loading

codecov bot commented May 5, 2025 • edited Loading

Codecov Report

gramalingam commented May 5, 2025

justinchuby May 7, 2025

Choose a reason for hiding this comment

titaiwangms May 8, 2025

Choose a reason for hiding this comment

justinchuby commented May 7, 2025

titaiwangms May 8, 2025

Choose a reason for hiding this comment

kunal-vaishnavi May 8, 2025

Choose a reason for hiding this comment

justinchuby May 9, 2025

Choose a reason for hiding this comment

titaiwangms commented May 5, 2025 •

edited

Loading

codecov bot commented May 5, 2025 •

edited

Loading