Skip to content

Silent wrong result on FusionTransposeViewSelfMapping_CUDA #1982

@zasdfgbnm

Description

@zasdfgbnm

🐛 Describe the bug

Our codegen fails on fusion like

x = torch.zeros(2, 3)
x.transpose(0, 1) + x.view(3, 2)

i.e.:

t0 = torch.zeros(2, 3);
t1 = t0.view({3,2})
t2 = t0.transpose(0, 1)
t3 = t1 + t2

We should:

  1. Figure out why our codegen doesn't detect this and report an error
  2. Find a way to detect this in schedulers so the DAG is segmented correctly

See discussion: #1954 (comment)

Versions

TOT devel

Metadata

Metadata

Assignees

No one assigned

    Labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions