Add `convert_to_tensor` to `tensorflow` #11292

hoel-bagard · 2024-01-19T08:12:51Z

Add the convert_to_tensor TensorFlow function.

The actual return type of the function is Union[EagerTensor, SymbolicTensor] (cf the tensorflow function), however those are not present in the stub yet, should they be added ?

I would like to help complete the TensorFlow stubs, so I would be thankful if you could tell me anything I would need to know in order to contribute.

stubs/tensorflow/tensorflow/__init__.pyi

hmc-cs-mdrissi

One small comment. Otherwise if you'd like to add tf stubs I'm happy to review, with one caveat it may help if I give you remaining tf stubs I have.

The current typeshed tf stubs are subset of ones I use, but haven't spent much time in open source recently and each time I add stubs to typeshed I also do a pass to clean them up so some discrepancies have accumulated. The stubs I have do contain a lot more files/missing classes though and can be a good starting point.

hmc-cs-mdrissi · 2024-01-25T21:23:12Z

https://github.com/hmc-cs-mdrissi/tensorflow_stubs I've placed all of tensorflow internal stubs I have in this public repository. I'd be happy to see them incorporated to typeshed and some of classes you want like Model do have stubs there.

These stubs have diverged a little from typeshed final ones, but they are very similar.

github-actions · 2024-01-25T23:48:31Z

According to mypy_primer, this change has no effect on the checked open source code. 🤖🎉

hoel-bagard · 2024-01-26T00:03:36Z

@hmc-cs-mdrissi Thank you for the review, and thank you for sharing your stubs!

I don't have a huge codebase to test against, so the stubs I made are incomplete and too restrictive. I'll start by completing/fixing what I already have here using your stubs if that's ok with you. Or if there's a way I can help get your stubs incorporated to typeshed, please let me know!

On a bit of an unrelated note, I have two questions about how you're using your stubs (or the ones in typeshed):

When using the shape of a tensor, TensorShape's __getitem__ and __iter__ return int | None, even though in practice the return type is basically always int. I find that checking the type of each dimension is quite verbose and annoying. Do you have a good way to go about it ?
When creating custom Layers or Models in my code base, I would like to be able to annotate them with [_InputT, _OutputT]. However this is not possible in the actual python code, so I resorted to creating a stub file next to each of my layer/model definition file. But this duplicates the number of files and seems like it will get hard to maintain. Is there a better way to do it ?

I just had to switch from PyTorch to TensorFlow, so any advice on how to type a TensorFlow codebase would be extremely welcome.

hmc-cs-mdrissi · 2024-01-26T00:40:04Z

When using the shape of a tensor, TensorShape's getitem and iter return int | None, even though in practice the return type is basically always int. I find that checking the type of each dimension is quite verbose and annoying. Do you have a good way to go about it ?

Concrete tensors always return int. Symbolic tensors may return None for dynamic dimensions like batch size. My experience is if you are writing a layer then first dimension (batch size) can commonly be None while other dimensions are rarely None. In practice you normally know if you have symbolic vs concrete tensor but I don’t see an easy way in type system to distinguish the two especially as which is used can vary on tensorflow execution mode (eager vs graph vs tf1). So I normally just put an assert not None as needed. In pytorch tensors are almost always concrete and if you stick to tf2 eager you may not notice the Nones much.

When creating custom Layers or Models in my code base, I would like to be able to annotate them with [_InputT, _OutputT]. However this is not possible in the actual python code, so I resorted to creating a stub file next to each of my layer/model definition file. But this duplicates the number of files and seems like it will get hard to maintain. Is there a better way to do it ?

Ideally runtime would become generic. While that's not case that way I handle it for layers/models is like this,

if TYPE_CHECKING:
    _KerasLayerBase = _KerasLayer
    _ModelBase = Model
else:
    class _KerasLayerBase(_KerasLayer, Generic[InputT, OutputT]):
        ...

    class _ModelBase(Model, Generic[InputT, OutputT]):
        ...


class KerasLayerGeneric(_KerasLayerBase[InputT, OutputT], Generic[InputT, OutputT]):
  ...

class KerasModelGeneric(_ModelBase[InputT, OutputT], Generic[InputT, OutputT]):
  ...

class KerasLayer(KerasLayerGeneric[tf.Tensor, tf.Tensor]):
  ...

class KerasModel(KerasModelGeneric[tf.Tensor, tf.Tensor]):
  ...

and then I always use these types at runtime.

I don't have a huge codebase to test against, so the stubs I made are incomplete and too restrictive. I'll start by completing/fixing what I already have #11306 using your stubs if that's ok with you. Or if there's a way I can help get your stubs incorporated to typeshed, please let me know!

My recommendation is do it few files at a time and keep pr size to couple hundred lines (at most 1k) for reviewability. You can pick any files in my stubs and prepare a PR for them here. The main steps I've had for adding to typeshed,

Pick some files to work on.
Work through stubtest/typeshed's CI. While my stubs are used with medium sized codebase, stubtest and primer often detect interesting issues to fix. One common issue is stubs were written mostly for tf 2.8/2.9, but typeshed stubs target a newer tf so stubtest may detect missing new parameters.
Work through PR review comments. So far from typeshed maintainers, and I'd be happy to review any tensorflow prs.

hoel-bagard · 2024-01-26T06:55:09Z

Thanks for sharing the snipet to make layers/models into generics, that worked really nicely!

My recommendation is do it few files at a time and keep pr size to couple hundred lines (at most 1k) for reviewability. You can pick any files in my stubs and prepare a PR for them here. The main steps I've had for adding to typeshed,

Pick some files to work on.

Work through stubtest/typeshed's CI. While my stubs are used with medium sized codebase, stubtest and primer often detect interesting issues to fix. One common issue is stubs were written mostly for tf 2.8/2.9, but typeshed stubs target a newer tf so stubtest may detect missing new parameters.

Work through PR review comments. So far from typeshed maintainers, and I'd be happy to review any tensorflow prs.

I'll do that, thanks.

hoel-bagard and others added 2 commits January 19, 2024 16:57

Add convert_to_tensor

9214fc9

[pre-commit.ci] auto fixes from pre-commit.com hooks

0e13f41

This comment has been minimized.

Sign in to view

hoel-bagard mentioned this pull request Jan 24, 2024

Add TensorFlow stubs #11306

Closed

hmc-cs-mdrissi reviewed Jan 25, 2024

View reviewed changes

stubs/tensorflow/tensorflow/__init__.pyi Outdated Show resolved Hide resolved

hmc-cs-mdrissi reviewed Jan 25, 2024

View reviewed changes

hoel-bagard and others added 2 commits January 26, 2024 08:41

fix: convert_to_tensor's type hints

0e54c81

[pre-commit.ci] auto fixes from pre-commit.com hooks

79b9900

hmc-cs-mdrissi approved these changes Jan 26, 2024

View reviewed changes

JelleZijlstra merged commit 4fa759f into python:main Jan 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Add `convert_to_tensor` to `tensorflow` #11292

Add `convert_to_tensor` to `tensorflow` #11292

Uh oh!

hoel-bagard commented Jan 19, 2024

Uh oh!

This comment has been minimized.

Uh oh!

hmc-cs-mdrissi left a comment

Uh oh!

hmc-cs-mdrissi commented Jan 25, 2024

Uh oh!

github-actions bot commented Jan 25, 2024

Uh oh!

hoel-bagard commented Jan 26, 2024 •

edited

Loading

Uh oh!

hmc-cs-mdrissi commented Jan 26, 2024

Uh oh!

hoel-bagard commented Jan 26, 2024

Uh oh!

Uh oh!

Uh oh!

Add convert_to_tensor to tensorflow #11292

Add convert_to_tensor to tensorflow #11292

Uh oh!

Conversation

hoel-bagard commented Jan 19, 2024

Uh oh!

This comment has been minimized.

Uh oh!

hmc-cs-mdrissi left a comment

Choose a reason for hiding this comment

Uh oh!

hmc-cs-mdrissi commented Jan 25, 2024

Uh oh!

github-actions bot commented Jan 25, 2024

Uh oh!

hoel-bagard commented Jan 26, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

hmc-cs-mdrissi commented Jan 26, 2024

Uh oh!

hoel-bagard commented Jan 26, 2024

Uh oh!

Uh oh!

Add `convert_to_tensor` to `tensorflow` #11292

Add `convert_to_tensor` to `tensorflow` #11292

hoel-bagard commented Jan 26, 2024 •

edited

Loading