Mllama single qpc support added #258

quic-amitraj · 2025-02-03T14:53:46Z

Mllama single qpc support added
Simplified generate inputs for single and dual qpc

QEfficient/base/onnx_transforms.py

QEfficient/transformers/models/mllama/modeling_mllama.py

vbaddi · 2025-02-03T15:49:07Z

QEfficient/transformers/models/mllama/modeling_mllama.py

+        # Out-of-place Scatter new into old
+        # out-of-place is important so the original tensor is not affected,
+        # otherwise leads to same operations in both graphs
+        indices = (torch.arange(bsz),)


add a brief documentation on why these changes are required for single qpc and how does it create the graph.

Sure, will update in final version.

vbaddi · 2025-02-03T15:50:02Z

QEfficient/transformers/models/mllama/modeling_mllama.py

-        return outputs
+        return outputs
+
+    def generate_mllama_single(self, processor):


this is just required for the onnx export right?

Yes, it is. As processor output varies model to model, this function will help to get the model specific processor output. Now I have also removed the dependency of processor by creating dummy inputs and made it generic for the single and dual qpcs.

QEfficient/transformers/models/modeling_auto.py

Signed-off-by: Amit Raj <[email protected]>

1. Mllama single qpc support added 2. Simplified generate inputs for single and dual qpc --------- Signed-off-by: Amit Raj <[email protected]> Co-authored-by: asmigosw <[email protected]> Signed-off-by: Amit Raj <[email protected]>

quic-amitraj requested review from ochougul and quic-rishinr as code owners February 3, 2025 14:53

vbaddi requested changes Feb 3, 2025

View reviewed changes

vbaddi assigned quic-amitraj and quic-rishinr Feb 3, 2025

vbaddi added the model-enablement label Feb 3, 2025

asmigosw and others added 6 commits February 3, 2025 19:17

Single qpc support till export

ad5f976

Signed-off-by: Amit Raj <[email protected]>

Minor changes

fd573d8

Signed-off-by: Amit Raj <[email protected]>

Single qpc support

bcff694

Signed-off-by: Amit Raj <[email protected]>

Minor fixes-1

ddc9bfc

Signed-off-by: Amit Raj <[email protected]>

Generate input restructure

15161ce

Signed-off-by: Amit Raj <[email protected]>

ruff fix

a3271c1

Signed-off-by: Amit Raj <[email protected]>

quic-amitraj force-pushed the mllama_single_qpc branch from 1378b1d to a3271c1 Compare February 3, 2025 19:20

quic-amitraj added 2 commits February 4, 2025 04:43

Generate input fix

eaaa517

Signed-off-by: Amit Raj <[email protected]>

Addressed Comments

67cb5ef

Signed-off-by: Amit Raj <[email protected]>

quic-amitraj force-pushed the mllama_single_qpc branch from d2c879d to 67cb5ef Compare February 4, 2025 05:06

quic-rishinr merged commit 8624eec into mllama_vision Feb 4, 2025
3 checks passed

ochougul deleted the mllama_single_qpc branch February 18, 2025 06:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Mllama single qpc support added #258

Mllama single qpc support added #258

Uh oh!

quic-amitraj commented Feb 3, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

vbaddi Feb 3, 2025

Uh oh!

quic-amitraj Feb 4, 2025

Uh oh!

vbaddi Feb 3, 2025

Uh oh!

quic-amitraj Feb 4, 2025

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Mllama single qpc support added #258

Mllama single qpc support added #258

Uh oh!

Conversation

quic-amitraj commented Feb 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

vbaddi Feb 3, 2025

Choose a reason for hiding this comment

Uh oh!

quic-amitraj Feb 4, 2025

Choose a reason for hiding this comment

Uh oh!

vbaddi Feb 3, 2025

Choose a reason for hiding this comment

Uh oh!

quic-amitraj Feb 4, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

quic-amitraj commented Feb 3, 2025 •

edited

Loading