-
Notifications
You must be signed in to change notification settings - Fork 364
Closed
Labels
No Activitycomponent: dynamoIssues relating to the `torch.compile` or `torch._dynamo.export` pathsIssues relating to the `torch.compile` or `torch._dynamo.export` pathsfeature requestNew feature or requestNew feature or requestfx
Description
Context
Certain Torch layers, including torch.nn.Embedding
handle inputs, weights, and constant Tensors which are required by Torch to be 64-bit types. This causes issues when attempting to build TRT converters for such engines, since the inputs and constants will then be 64-bit, which is not supported by TRT.
Feature Proposal
When users specify the truncate_long_and_double
flag, it should have an effect in both FX and Dynamo. A similar feature was implemented for Dynamo compile in a broader context in PR #1983. For this feature, if the user specifies the truncate_long_and_double
flag, we should automatically cast the 64-bit constant tensors (weights, constant indices, etc.) to their 32-bit equivalents for use in TRT.
Metadata
Metadata
Assignees
Labels
No Activitycomponent: dynamoIssues relating to the `torch.compile` or `torch._dynamo.export` pathsIssues relating to the `torch.compile` or `torch._dynamo.export` pathsfeature requestNew feature or requestNew feature or requestfx