NotImplementedError: The operator 'aten::_conv_depthwise2d' is not currently implemented for the XPU device.

### 🐛 Describe the bug

```
python benchmarks/dynamo/huggingface.py --accuracy --float16 -d xpu -n10 --inference --only YituTechConvBert --backend=inductor --cold-start-latency --timeout=10800

loading model: 0it [00:01, ?it/s]
xpu  eval  YituTechConvBert
ERROR:common:
Traceback (most recent call last):
  File "/home/sdp/actions-runner/_work/torch-xpu-ops/pytorch/benchmarks/dynamo/common.py", line 2270, in check_accuracy
    new_result = self.run_n_iterations(
  File "/home/sdp/actions-runner/_work/torch-xpu-ops/pytorch/benchmarks/dynamo/common.py", line 1957, in run_n_iterations
    model_iter_fn(mod, inputs, collect_outputs=False)
  File "/home/sdp/.conda/envs/e2e_ci/lib/python3.10/site-packages/torch/_dynamo/eval_frame.py", line 658, in _fn
    return fn(*args, **kwargs)
  File "/home/sdp/actions-runner/_work/torch-xpu-ops/pytorch/benchmarks/dynamo/huggingface.py", line 523, in forward_pass
    def forward_pass(self, mod, inputs, collect_outputs=True):
  File "/home/sdp/.conda/envs/e2e_ci/lib/python3.10/site-packages/torch/_dynamo/eval_frame.py", line 850, in _fn
    return fn(*args, **kwargs)
  File "/home/sdp/.conda/envs/e2e_ci/lib/python3.10/site-packages/torch/_functorch/aot_autograd.py", line 1207, in forward
    return compiled_fn(full_args)
  File "/home/sdp/.conda/envs/e2e_ci/lib/python3.10/site-packages/torch/_functorch/_aot_autograd/runtime_wrappers.py", line 331, in runtime_wrapper
    all_outs = call_func_at_runtime_with_args(
  File "/home/sdp/.conda/envs/e2e_ci/lib/python3.10/site-packages/torch/_functorch/_aot_autograd/utils.py", line 126, in call_func_at_runtime_with_args
    out = normalize_as_list(f(args))
  File "/home/sdp/.conda/envs/e2e_ci/lib/python3.10/site-packages/torch/_functorch/_aot_autograd/runtime_wrappers.py", line 692, in inner_fn
    outs = compiled_fn(args)
  File "/home/sdp/.conda/envs/e2e_ci/lib/python3.10/site-packages/torch/_functorch/_aot_autograd/runtime_wrappers.py", line 498, in wrapper
    return compiled_fn(runtime_args)
  File "/home/sdp/.conda/envs/e2e_ci/lib/python3.10/site-packages/torch/_inductor/output_code.py", line 561, in __call__
    return self.current_callable(inputs)
  File "/home/sdp/.conda/envs/e2e_ci/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2449, in run
    return model(new_inputs)
  File "/tmp/tmpd7imunxp/hh/chhfzkj7uoru46ifjhp5vzlulb5ejwuxtzv4n5vxcysnzadaehek.py", line 1113, in call
    buf14 = extern_kernels.convolution(reinterpret_tensor(buf4, (1, 768, 1, 512), (0, 1, 0, 768), 0), reinterpret_tensor(arg14_1, (768, 1, 1, 9), (9, 9, 9, 1), 0), stride=(1, 1), padding=(0, 4), dilation=(1, 1), transposed=False, output_padding=(0, 0), groups=768, bias=None)
NotImplementedError: The operator 'aten::_conv_depthwise2d' is not currently implemented for the XPU device. Please open a feature on https://github.com/intel/torch-xpu-ops/issues. You can set the environment variable `PYTORCH_ENABLE_XPU_FALLBACK=1` to use the CPU implementation as a fallback for XPU unimplemented operators. WARNING: this will bring unexpected performance compared with running natively on XPU.
TorchDynamo optimized model failed to run because of following error
fail_to_run

```

### Versions

pytorch: 8497491f38cb80aed0e91de9d0d7c3abd83ebfee
torch-xpu-ops: 87906acd61025551ba4b82ea19d4efdc9ac5e0f2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

NotImplementedError: The operator 'aten::_conv_depthwise2d' is not currently implemented for the XPU device. #1576

🐛 Describe the bug

Versions

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

NotImplementedError: The operator 'aten::_conv_depthwise2d' is not currently implemented for the XPU device. #1576

Description

🐛 Describe the bug

Versions

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions