fix: Use user provided dtype when we can't infer it from the graph #962

peri044 · 2022-04-05T07:42:03Z

Signed-off-by: Dheeraj Peri [email protected]

Description

Use user provided dtype when we can't infer it from the graph

Fixes #911 (comment)

Type of change

Bug fix (non-breaking change which fixes an issue)

Checklist:

My code follows the style guidelines of this project (You can use the linters)
I have performed a self-review of my own code
I have commented my code, particularly in hard-to-understand areas and hacks
I have made corresponding changes to the documentation
I have added tests to verify my fix or my feature
New and existing unit tests pass locally with my changes

Signed-off-by: Dheeraj Peri <[email protected]>

github-actions

Code conforms to Python style guidelines

github-actions

Code conforms to C++ style guidelines

github-actions

Code conforms to Python style guidelines

github-actions

Code conforms to C++ style guidelines

narendasan · 2022-04-05T15:35:19Z

py/setup.py

@@ -239,8 +239,6 @@ def run(self):
        libraries=["torchtrt"],
        include_dirs=[
            dir_path + "torch_tensorrt/csrc", dir_path + "torch_tensorrt/include",
-            dir_path + "/../bazel-TRTorch/external/tensorrt/include",


Might as well keep this since if people dont redownload the repo after the namechange its going to break the build

added it back. But it would be nice to clean this up at some point in time.

narendasan · 2022-04-05T15:38:03Z

core/compiler.cpp

@@ -328,7 +328,8 @@ void MapInputsAndDetermineDTypes(
        spec.dtype = nvinfer1::DataType::kFLOAT;
      } else if (spec.dtype_is_user_defined && cfg.partition_info.enabled) {
        if (!est_type_opt) {
-          LOG_INFO("Cannot infer input tensor dtype in graph, unable to verify user input dtype settings");
+          LOG_INFO("Cannot infer input tensor dtype in graph. Using user provided input dtype settings");
+          first_use_type_map[in] = {util::TRTDataTypeToScalarType(cfg.convert_info.inputs.find(in)->second.dtype)};


how does this work with this patch? #902

Your patch works in the case where the estimated type is not same as the user provided type. In this case, you override with user provided dtype.
In this PR. if the user dtype is defined (dtype_is_user_defined=True) and if the est_type_opt is null (which happens for the bitwise OR graph I linked in the PR where inferred type is empty, I use user provided types.

narendasan · 2022-04-05T15:39:05Z

core/ir/Input.cpp

@@ -40,6 +40,13 @@ bool valid_dtype_format_combo(nvinfer1::DataType dtype, nvinfer1::TensorFormat f
        default:
          return false;
      }
+    case nvinfer1::DataType::kBOOL: // Supports Linear (NCHW)
+      switch (format) {


We probably should be more clear to the user why a BOOL / NHWC combo doesnt work

NHWC is only bound to FP32 according to https://docs.nvidia.com/deeplearning/tensorrt/api/c_api/namespacenvinfer1.html#ac3e115b1a2b1e578e8221ef99d27cd45

Yeah, I understand, I mean we need to throw a descriptive error or something

Probably put this at the API level

…r local repo name Signed-off-by: Dheeraj Peri <[email protected]>

github-actions

Code conforms to C++ style guidelines

github-actions

Code conforms to Python style guidelines

Signed-off-by: Dheeraj Peri <[email protected]>

github-actions

There are some changes that do not conform to C++ style guidelines:

diff --git a/workspace/cpp/src/types.cpp b/tmp/changes.txt
index d60f6bd..993a9e5 100644
--- a/workspace/cpp/src/types.cpp
+++ b/tmp/changes.txt
@@ -161,7 +161,7 @@ Input::Input(std::vector<int64_t> shape, DataType dtype, TensorFormat format) {
  this->shape = shape;
  this->dtype = dtype;
  this->format = format;
-  if(dtype == DataType::kBool && format == TensorFormat::kChannelsLast){
+  if (dtype == DataType::kBool && format == TensorFormat::kChannelsLast) {
    TORCHTRT_THROW_ERROR("Input datatype (Bool) is not currently supported with Tensorformat NHWC (kChannelsLast)");
  }
  this->input_is_dynamic = false;
@@ -184,7 +184,7 @@ Input::Input(c10::IntArrayRef shape, DataType dtype, TensorFormat format) {
  this->max_shape = torch_tensorrt::core::util::toVec(shape);
  this->shape = torch_tensorrt::core::util::toVec(shape);
  this->dtype = dtype;
-  if(dtype == DataType::kBool && format == TensorFormat::kChannelsLast){
+  if (dtype == DataType::kBool && format == TensorFormat::kChannelsLast) {
    TORCHTRT_THROW_ERROR("Input datatype (Bool) is not currently supported with Tensorformat NHWC (kChannelsLast)");
  }
  this->format = format;
@@ -220,7 +220,7 @@ Input::Input(
  this->shape = torch_tensorrt::core::util::toVec(
      torch_tensorrt::core::ir::Input(this->min_shape, this->opt_shape, this->max_shape).input_shape);
  this->dtype = dtype;
-  if(dtype == DataType::kBool && format == TensorFormat::kChannelsLast){
+  if (dtype == DataType::kBool && format == TensorFormat::kChannelsLast) {
    TORCHTRT_THROW_ERROR("Input datatype (Bool) is not currently supported with Tensorformat NHWC (kChannelsLast)");
  }
  this->format = format;
@@ -250,7 +250,7 @@ Input::Input(
  this->shape = torch_tensorrt::core::util::toVec(
      torch_tensorrt::core::ir::Input(this->min_shape, this->opt_shape, this->max_shape).input_shape);
  this->dtype = dtype;
-  if(dtype == DataType::kBool && format == TensorFormat::kChannelsLast){
+  if (dtype == DataType::kBool && format == TensorFormat::kChannelsLast) {
    TORCHTRT_THROW_ERROR("Input datatype (Bool) is not currently supported with Tensorformat NHWC (kChannelsLast)");
  }
  this->format = format;
@@ -263,7 +263,7 @@ Input::Input(at::Tensor tensor) {
  this->max_shape = tensor.sizes().vec();
  this->shape = tensor.sizes().vec();
  this->dtype = tensor.scalar_type();
-  if(dtype == DataType::kBool && format == TensorFormat::kChannelsLast){
+  if (dtype == DataType::kBool && format == TensorFormat::kChannelsLast) {
    TORCHTRT_THROW_ERROR("Input datatype (Bool) is not currently supported with Tensorformat NHWC (kChannelsLast)");
  }
  TORCHTRT_ASSERT(
@@ -282,7 +282,7 @@ Input::Input(at::Tensor tensor) {
/* ==========================================*/

torch_tensorrt::core::ir::Input to_internal_input(Input& i) {
-  if(i.dtype == DataType::kBool && i.format == TensorFormat::kChannelsLast){
+  if (i.dtype == DataType::kBool && i.format == TensorFormat::kChannelsLast) {
    TORCHTRT_THROW_ERROR("Input datatype (Bool) is not currently supported with Tensorformat NHWC (kChannelsLast)");
  }
  return torch_tensorrt::core::ir::Input(
ERROR: Some files do not conform to style guidelines

github-actions

Code conforms to Python style guidelines

Signed-off-by: Dheeraj Peri <[email protected]>

github-actions

Code conforms to C++ style guidelines

github-actions

Code conforms to Python style guidelines

narendasan

LGTM

fix: Use user provided dtype when we can't infer it from the graph

14650d1

Signed-off-by: Dheeraj Peri <[email protected]>

peri044 requested a review from narendasan April 5, 2022 07:42

github-actions bot added component: api [Python] Issues re: Python API component: core Issues re: The core compiler labels Apr 5, 2022

github-actions bot approved these changes Apr 5, 2022

View reviewed changes

narendasan reviewed Apr 5, 2022

View reviewed changes

chore: Add bazel-TRTorch reference for users who haven't changed thei…

aed82a0

…r local repo name Signed-off-by: Dheeraj Peri <[email protected]>

github-actions bot approved these changes Apr 5, 2022

View reviewed changes

chore: Handle valid dtype and tensorformat checks in cpp api level

aa3f57a

Signed-off-by: Dheeraj Peri <[email protected]>

github-actions bot added the component: api [C++] Issues re: C++ API label Apr 8, 2022

github-actions bot requested changes Apr 8, 2022

View reviewed changes

github-actions bot approved these changes Apr 8, 2022

View reviewed changes

chore: Revert previous changes

b677e8d

Signed-off-by: Dheeraj Peri <[email protected]>

github-actions bot approved these changes Apr 8, 2022

View reviewed changes

narendasan approved these changes Apr 9, 2022

View reviewed changes

narendasan merged commit 40f8b44 into master Apr 9, 2022

narendasan deleted the user_dtype branch April 9, 2022 03:35

fix: Use user provided dtype when we can't infer it from the graph #962

fix: Use user provided dtype when we can't infer it from the graph #962

Uh oh!

Conversation

peri044 commented Apr 5, 2022

Description

Type of change

Checklist:

Uh oh!

github-actions bot left a comment

Choose a reason for hiding this comment

Uh oh!

github-actions bot left a comment

Choose a reason for hiding this comment

Uh oh!

github-actions bot left a comment

Choose a reason for hiding this comment

Uh oh!

github-actions bot left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

github-actions bot left a comment

Choose a reason for hiding this comment

Uh oh!

github-actions bot left a comment

Choose a reason for hiding this comment

Uh oh!

github-actions bot left a comment

Choose a reason for hiding this comment

Uh oh!

github-actions bot left a comment

Choose a reason for hiding this comment

Uh oh!

github-actions bot left a comment

Choose a reason for hiding this comment

Uh oh!

github-actions bot left a comment

Choose a reason for hiding this comment

Uh oh!

narendasan left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!