Fix some annotations in transforms v2 for JIT v1 compatibility #7252

pmeier · 2023-02-15T10:15:00Z

TL;DR this reverts changes to the annotation of the fill and padding parameter that we made for v2, but turned out to not be compatible with the JIT behavior of v1.

We have three groups of transforms that take the fill parameter in v1:

AA family:

vision/torchvision/transforms/autoaugment.py

Lines 117 to 118 in f6b5b82

    
                   fill (sequence or number, optional): Pixel fill value for the area outside the transformed 
        
                       image. If given a number, the value is used for all bands respectively.

Affine family:

vision/torchvision/transforms/transforms.py

Lines 1412 to 1413 in f6b5b82

    
                   fill (sequence or number): Pixel fill value for the area outside the transformed 
        
                       image. Default is ``0``. If given a number, the value is used for all bands respectively.

transforms.Pad

vision/torchvision/transforms/transforms.py

Lines 408 to 412 in f6b5b82

    
                   fill (number or tuple): Pixel fill value for constant fill. Default is 0. If a tuple of 
        
                       length 3, it is used to fill R, G, B channels respectively. 
        
                       This value is only used when the padding_mode is constant. 
        
                       Only number is supported for torch Tensor. 
        
                       Only int or tuple value is supported for PIL Image.

In eager mode v1 and v2 behave the same and we enforce that in our consistency tests. However, for JIT they behave differently. The transforms aren't annotating the fill parameter, so we have to look at the functionals:

AA family:

vision/torchvision/transforms/autoaugment.py

Line 14 in f6b5b82

img: Tensor, op_name: str, magnitude: float, interpolation: InterpolationMode, fill: Optional[List[float]]
Affine family:

vision/torchvision/transforms/functional.py

Line 1148 in f6b5b82

fill: Optional[List[float]] = None,
F.pad:

vision/torchvision/transforms/functional.py

Line 495 in f6b5b82

def pad(img: Tensor, padding: List[int], fill: Union[int, float] = 0, padding_mode: str = "constant") -> Tensor:

Let's start with the AA and affine family, since they use the same annotation:

import torch
from torchvision.transforms import functional as F_v1
from torchvision.prototype.transforms import functional as F_v2

name = "rotate"
args = (torch.rand(3, 256, 256),)
kwargs = dict(angle=30)


for version, F in [
    ("v1", F_v1),
    ("v2", F_v2),
]:
    eager = getattr(F, name)
    scripted = torch.jit.script(eager)

    print(version, name)

    for fill in [None, 1, 0.5, [1], [0.5], (1,), (0.5,), [1, 0, 1], [0.1, 0.2, 0.3], (1, 0, 1), (0.1, 0.2, 0.3)]:
        try:
            eager(*args, **kwargs, fill=fill)
            eager_result = "PASS"
        except:
            eager_result = "FAIL"

        try:
            scripted(*args, **kwargs, fill=fill)
            scripted_result = "PASS"
        except:
            scripted_result = "FAIL"

        print(f"{str(fill):>15}: eager {eager_result}, scripted {scripted_result}")

    print("-" * 80)

On main this prints:

v1 rotate
           None: eager PASS, scripted PASS
              1: eager PASS, scripted FAIL
            0.5: eager PASS, scripted FAIL
            [1]: eager PASS, scripted PASS
          [0.5]: eager PASS, scripted PASS
           (1,): eager PASS, scripted PASS
         (0.5,): eager PASS, scripted PASS
      [1, 0, 1]: eager PASS, scripted PASS
[0.1, 0.2, 0.3]: eager PASS, scripted PASS
      (1, 0, 1): eager PASS, scripted PASS
(0.1, 0.2, 0.3): eager PASS, scripted PASS
--------------------------------------------------------------------------------
v2 rotate
           None: eager PASS, scripted PASS
              1: eager PASS, scripted PASS
            0.5: eager PASS, scripted PASS
            [1]: eager PASS, scripted FAIL
          [0.5]: eager PASS, scripted PASS
           (1,): eager PASS, scripted FAIL
         (0.5,): eager PASS, scripted FAIL
      [1, 0, 1]: eager PASS, scripted FAIL
[0.1, 0.2, 0.3]: eager PASS, scripted PASS
      (1, 0, 1): eager PASS, scripted FAIL
(0.1, 0.2, 0.3): eager PASS, scripted FAIL
--------------------------------------------------------------------------------

v1 does not work with scalar ints or floats, but passes for everything else
v2 does work for Python scalars, but doesn't for tuples or sequences of integers

So how did that happen? In v2 we changed the annotation to

vision/torchvision/prototype/datapoints/_datapoint.py

Line 15 in f6b5b82

FillTypeJIT = Union[int, float, List[float], None]

Meaning, failures for the list of integers are "expected" (we'll get to why later), but what happened to the tuples? v1 didn't annotate them either?

This is caused by some (undocumented) automagic of JIT. Annotating something with List[int] will automatically handle tuple inputs as well:

@torch.jit.script
def foo(data: List[int]) -> torch.Tensor:
    if isinstance(data, int):
        data = [data]
    return torch.tensor(data)

foo((1, 2, 3))

However, if correct the annotation to Union[int, List[int]], the automagic is no longer applied:

@torch.jit.script
def bar(data: Union[int, List[int]]) -> torch.Tensor:
    if isinstance(data, int):
        data = [data]
    return torch.tensor(data)

bar((1, 2, 3))

RuntimeError: bar() Expected a value of type 'Union[List[int], int]' for argument 'data' but instead found type 'tuple'.

Well, we could just add Tuple[int] to the new annotation, right? Nope. Tuple[int] is not the equivalent to List[int]. That would be Tuple[int, ...], but that is not supported by JIT. And since fill corresponds to the number of channels that will only be known at runtime, we cannot use Tuple[int, int, int] or any other fixed number. Meaning, we need to rely on the automagic for BC and need to revert our annotation changes.

So this is the end of the story? Nope again. As we saw above, F.pad uses different annotations. Let's run our script from above with

name = "pad"
args = (torch.rand(3, 256, 256),)
kwargs = dict(padding=[2])

v1 pad
           None: eager PASS, scripted FAIL
              1: eager PASS, scripted PASS
            0.5: eager PASS, scripted PASS
            [1]: eager FAIL, scripted FAIL
          [0.5]: eager FAIL, scripted FAIL
           (1,): eager FAIL, scripted FAIL
         (0.5,): eager FAIL, scripted FAIL
      [1, 0, 1]: eager FAIL, scripted FAIL
[0.1, 0.2, 0.3]: eager FAIL, scripted FAIL
      (1, 0, 1): eager FAIL, scripted FAIL
(0.1, 0.2, 0.3): eager FAIL, scripted FAIL
--------------------------------------------------------------------------------
v2 pad
           None: eager PASS, scripted PASS
              1: eager PASS, scripted PASS
            0.5: eager PASS, scripted PASS
            [1]: eager PASS, scripted FAIL
          [0.5]: eager PASS, scripted PASS
           (1,): eager PASS, scripted FAIL
         (0.5,): eager PASS, scripted FAIL
      [1, 0, 1]: eager PASS, scripted FAIL
[0.1, 0.2, 0.3]: eager PASS, scripted PASS
      (1, 0, 1): eager PASS, scripted FAIL
(0.1, 0.2, 0.3): eager PASS, scripted FAIL
--------------------------------------------------------------------------------

v1 only works with scalars even in eager mode (None does not work while scripting)
v2 added support for multi-channel fills in eager mode and even passes on some of them during scripting

Meaning, we can keep our new annotation for F.pad since the v2 variant supports a superset of the values of v1 in eager as in scripted mode.

What's left is the padding argument on F.pad:

v1:

vision/torchvision/transforms/functional.py

Line 495 in f6b5b82

def pad(img: Tensor, padding: List[int], fill: Union[int, float] = 0, padding_mode: str = "constant") -> Tensor:
v2:

vision/torchvision/prototype/transforms/functional/_geometry.py

Line 1134 in f6b5b82

padding: Union[int, List[int]],

You can probably see where this is going. Changing the script from above to

name = "pad"
args = (torch.rand(3, 256, 256),)
kwargs = dict()

and iterating over different padding values gives us:

v1 pad
           1: eager PASS, scripted FAIL
         [2]: eager PASS, scripted PASS
        (2,): eager PASS, scripted PASS
      [3, 4]: eager PASS, scripted PASS
      (3, 4): eager PASS, scripted PASS
[5, 6, 7, 8]: eager PASS, scripted PASS
(5, 6, 7, 8): eager PASS, scripted PASS
--------------------------------------------------------------------------------
v2 pad
           1: eager PASS, scripted PASS
         [2]: eager PASS, scripted PASS
        (2,): eager PASS, scripted FAIL
      [3, 4]: eager PASS, scripted PASS
      (3, 4): eager PASS, scripted FAIL
[5, 6, 7, 8]: eager PASS, scripted PASS
(5, 6, 7, 8): eager PASS, scripted FAIL
--------------------------------------------------------------------------------

Meaning, we need to revert the new annotation and rely on the automagic handling.

If this whole story wasn't so sad, this should probably should have been a blog post rather than a PR description.

cc @vfdev-5 @bjuncek

pmeier · 2023-02-15T10:16:51Z

test/prototype_transforms_dispatcher_infos.py

@@ -96,25 +96,6 @@ def xfail_jit_python_scalar_arg(name, *, reason=None):
    )


-def xfail_jit_tuple_instead_of_list(name, *, reason=None):


We actually observed that our functionals didn't work for tuples, but missed to check if v1 enforces this. Since we have aligned the behavior now, we can also remove this helper as it is no longer in use.

pmeier · 2023-02-15T10:17:38Z

test/prototype_transforms_dispatcher_infos.py

            xfail_jit_python_scalar_arg("shear"),
-            xfail_jit_tuple_instead_of_list("fill"),
-            # TODO: check if this is a regression since it seems that should be supported if `int` is ok


We were on the right track 🤦

pmeier · 2023-02-15T10:18:17Z

test/prototype_transforms_kernel_infos.py

@@ -450,21 +430,21 @@ def _full_affine_params(**partial_params):
 ]


-def get_fills(*, num_channels, dtype, vector=True):
+def get_fills(*, num_channels, dtype):


We now make sure that we get all possible fill types.

pmeier · 2023-02-15T10:19:56Z

torchvision/prototype/datapoints/_datapoint.py

@@ -12,7 +12,7 @@

 D = TypeVar("D", bound="Datapoint")
 FillType = Union[int, float, Sequence[int], Sequence[float], None]
-FillTypeJIT = Union[int, float, List[float], None]
+FillTypeJIT = Optional[List[float]]


Revert this to what we had in v1 ...

pmeier · 2023-02-15T10:20:19Z

torchvision/prototype/datapoints/_bounding_box.py

@@ -118,7 +118,7 @@ def resized_crop(
    def pad(
        self,
        padding: Union[int, Sequence[int]],
-        fill: FillTypeJIT = None,
+        fill: Optional[Union[int, float, List[float]]] = None,


... but keep it for F.pad

NicolasHug

Thanks a lot Philip.

IIUC, you didn't really add new tests to make sure everything is OK, and instead you removed some of the xfail marks so that the pre-existing tests are actually ran? Are we sure they cover all the cases we want to support?

Regardless, I'll approve to unblock so we can merge ASAP and test these new changes against #7159

pmeier · 2023-02-15T10:36:56Z

IIUC, you didn't really add new tests to make sure everything is OK, and instead you removed some of the xfail marks so that the pre-existing tests are actually ran? Are we sure they cover all the cases we want to support?

Yes and no. Yes, I've removed some xfails, but I also expanded the tested parameters. See #7252 (comment). Previously we didn't test for single value lists or tuples in general for fill.

…ty (#7252) Reviewed By: vmoens Differential Revision: D44416629 fbshipit-source-id: ab4950cc6c3d313355f29c069838fb96fe9a2dbf

pmeier added 8 commits February 14, 2023 16:24

revert FillTypeJIT to what we had in v1

9dc3df3

expand tested fill values

d3de938

cleanup

8690d9e

fix padding annotation

a28313e

mypy

558ad3d

Merge branch 'main' into jit-fill

c5bcda4

cleanup

073d972

revert some changes to pad

6c0e3e6

pmeier added module: transforms prototype labels Feb 15, 2023

pmeier requested review from NicolasHug and vfdev-5 February 15, 2023 10:15

facebook-github-bot added the cla signed label Feb 15, 2023

pmeier commented Feb 15, 2023

View reviewed changes

NicolasHug approved these changes Feb 15, 2023

View reviewed changes

NicolasHug merged commit f9d1883 into pytorch:main Feb 15, 2023

pmeier deleted the jit-fill branch February 15, 2023 10:43

NicolasHug mentioned this pull request Feb 15, 2023

TODOs before 0.15 release #7217

Closed

49 tasks

This was referenced Feb 15, 2023

Ten crop annotation #7254

Merged

make type alias private #7266

Merged

facebook-github-bot pushed a commit that referenced this pull request Mar 28, 2023

[fbsync] Fix some annotations in transforms v2 for JIT v1 compatibili…

fd2d42a

…ty (#7252) Reviewed By: vmoens Differential Revision: D44416629 fbshipit-source-id: ab4950cc6c3d313355f29c069838fb96fe9a2dbf

pmeier mentioned this pull request Apr 20, 2023

Loosen overspecified type hints in functional #7529

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix some annotations in transforms v2 for JIT v1 compatibility #7252

Fix some annotations in transforms v2 for JIT v1 compatibility #7252

Uh oh!

pmeier commented Feb 15, 2023 •

edited

Loading

Uh oh!

pmeier Feb 15, 2023

Uh oh!

pmeier Feb 15, 2023

Uh oh!

pmeier Feb 15, 2023

Uh oh!

pmeier Feb 15, 2023

Uh oh!

pmeier Feb 15, 2023

Uh oh!

NicolasHug left a comment

Uh oh!

pmeier commented Feb 15, 2023

Uh oh!

Uh oh!

	fill (sequence or number, optional): Pixel fill value for the area outside the transformed
	image. If given a number, the value is used for all bands respectively.

	fill (sequence or number): Pixel fill value for the area outside the transformed
	image. Default is ``0``. If given a number, the value is used for all bands respectively.

	fill (number or tuple): Pixel fill value for constant fill. Default is 0. If a tuple of
	length 3, it is used to fill R, G, B channels respectively.
	This value is only used when the padding_mode is constant.
	Only number is supported for torch Tensor.
	Only int or tuple value is supported for PIL Image.

		@@ -96,25 +96,6 @@ def xfail_jit_python_scalar_arg(name, *, reason=None):
		)


		def xfail_jit_tuple_instead_of_list(name, *, reason=None):

Fix some annotations in transforms v2 for JIT v1 compatibility #7252

Fix some annotations in transforms v2 for JIT v1 compatibility #7252

Uh oh!

Conversation

pmeier commented Feb 15, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pmeier Feb 15, 2023

Choose a reason for hiding this comment

Uh oh!

pmeier Feb 15, 2023

Choose a reason for hiding this comment

Uh oh!

pmeier Feb 15, 2023

Choose a reason for hiding this comment

Uh oh!

pmeier Feb 15, 2023

Choose a reason for hiding this comment

Uh oh!

pmeier Feb 15, 2023

Choose a reason for hiding this comment

Uh oh!

NicolasHug left a comment

Choose a reason for hiding this comment

Uh oh!

pmeier commented Feb 15, 2023

Uh oh!

Uh oh!

pmeier commented Feb 15, 2023 •

edited

Loading