🐛 [Bug] Compilation Error on GPT-2

##  Bug Description

When converting the GPT-2 network (https://huggingface.co/gpt2) from TorchScript to Torch-TRT, the following error is encountered:

```python
compiled_cpp_mod = _C.compile_graph(module._c, _parse_compile_spec(spec))
RuntimeError: [Error thrown at core/partitioning/shape_analysis.cpp:167] Unsupported input data type unsigned char
```

## To Reproduce

Steps to reproduce the behavior:

1. Run torch_tensorrt.compile with GPT-2 model as input, using fp32 precision.
2. Choose fixed input size of [1, 128] and enable truncate_long_and_double with 12 GB workspace.
3. Pass in model keyword args to disable attention and hidden state outputs

## Expected behavior

Model should successfully compile to Torch-TRT. Specifically, internal (non-user-provided) type-casting issues should not cause errors.

## Environment

- Torch-TensorRT Version: 1.3.0a0+e3b99294
- PyTorch Version: 1.13.0.dev20220921+cu116
- CPU Architecture: Intel Xeon CPU
- OS: Ubuntu 20.04
- How you installed PyTorch: pip
- Build command you used: `python setup.py develop`
- Are you using local sources or building from archives: local
- Python version: 3.8.13
- CUDA version: 11.6

## Additional context

The problematic data in GPT-2 seems to be this [bias term](https://github.com/huggingface/transformers/blob/5e012f8e3c3a8d8bb5d6d95c58ff59a4a3156519/src/transformers/models/gpt2/modeling_gpt2.py#L128-L133), instantiated in the attention module, which has type `uint8`. In both the TorchScript IR and the model code ([example 1](https://github.com/huggingface/transformers/blob/5e012f8e3c3a8d8bb5d6d95c58ff59a4a3156519/src/transformers/models/gpt2/modeling_gpt2.py#L196), [example 2](https://github.com/huggingface/transformers/blob/5e012f8e3c3a8d8bb5d6d95c58ff59a4a3156519/src/transformers/models/gpt2/modeling_gpt2.py#L246)), it seems that this bias term is generally cast to a bool. The error is thrown in this code segment:
https://github.com/pytorch/TensorRT/blob/e3b992941b3ae5f1863de271fc9032829834ec6a/core/partitioning/shape_analysis.cpp#L258-L260
The conversion of a `uint8` type to a TRT Data Type fails, however simply patching this conversion also does not fix the issue, as an out-of-bounds error later follows.

## Temporary Solution

A temporary fix to this problem is to add the following to the compilation arguments in torch_tensorrt.compile:
```python
torch_tensorrt.compile( ..., torch_executed_ops=["aten::where"], ...)
```
This solution works as it happens to exclude the code which uses and processes the `uint8` tensor, however it is only a temporary fix and does not resolve the underlying issue.

## Steps to a Solution

- Fix mismatched dimension issue in `aten::where`
- Make `at::kByte` a valid input type

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

🐛 [Bug] Compilation Error on GPT-2 #1455

Bug Description

To Reproduce

Expected behavior

Environment

Additional context

Temporary Solution

Steps to a Solution

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

	c10::optional<nvinfer1::DataType> dtype = util::optTypeMetaToTRTDataType(cur_ivalue.toTensor().dtype());
	if (dtype == c10::nullopt) {
	TORCHTRT_THROW_ERROR("Unsupported input data type " << cur_ivalue.toTensor().dtype());

🐛 [Bug] Compilation Error on GPT-2 #1455

Description

Bug Description

To Reproduce

Expected behavior

Environment

Additional context

Temporary Solution

Steps to a Solution

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions