[TOSA] MultiheadAttention legalization #4382

catcor01 · 2025-11-19T14:51:03Z

No description provided.

- Legalize Torch scaled_dot_product_attention into TOSA by adding the necessary patterns in TorchToTosa.cpp plus backend type-conversion hooks. - Introduce a detailed decomposition path for multi-head attention within DecomposeComplexOps.cpp, preparing inputs for TOSA lowering. - Expands the PT1 e2e suite with a dedicated multi-head attention MLIR/Python test and drop the corresponding xfails now that the path works. Signed-off-by: Cathal Corbett <[email protected]> Change-Id: I96c17aefd25b979f1cf6e897d91d5a29f0a2fa85

catcor01 force-pushed the multihead_attention branch from 881f6ed to a98526f Compare November 19, 2025 14:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[TOSA] MultiheadAttention legalization #4382

[TOSA] MultiheadAttention legalization #4382

Uh oh!

catcor01 commented Nov 19, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

[TOSA] MultiheadAttention legalization #4382

Are you sure you want to change the base?

[TOSA] MultiheadAttention legalization #4382

Uh oh!

Conversation

catcor01 commented Nov 19, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant