Improve transforms #466

Oktai15 · 2018-04-07T10:15:56Z

Hi there!

At this moment, torchvision is so poor; in particular, transforms. It really restricts our ability for data augmentation. However, people have already done a few nice libraries for it. For instance:
https://github.com/mdbloice/Augmentor (already has compatibility with PyTorch)
https://github.com/aleju/imgaug

What about contact with authors and merged these great jobs to torchvision.transforms?

The text was updated successfully, but these errors were encountered:

fmassa · 2018-04-07T10:22:59Z

Hi,

Thanks for the feedback!

I'm in the process of adding new functionality to torchvision, to extend it to work with other data types (like bounding boxes).
I'll have a look at the links you provided.

Oktai15 · 2018-04-07T10:39:27Z

@fmassa, okay, thanks, it will cool :) Actually, I strongly recommend to look at this libraries instead of write new code, but, of course, you know better :)

By the way, don't forget about segmentation!

fmassa · 2018-04-07T11:52:27Z

Don't worry, I won't forget about instance segmentation :-)

Oktai15 · 2018-04-07T14:30:08Z

@fmassa sorry for offtop: I would like to notice that first provided library uses excellent way to transform two or more images. All transforms take list of images rather than just one image like transform in torchvision. For example:

def perform_operation(self, images):
#init random values
for image in images:
    augmented_images.append(do(image))
return augmented_images

I suppose it's great idea ^^

fmassa · 2018-04-07T15:09:07Z

It's indeed much simpler in some cases, but also more restrictive in more general setups.

For the record, this is something that has been bugging me for a long time, see #9 and #230 for some context.

In many cases, you don't want to pass all transforms to all data (no color augmentation for segmentation masks for example), and the approach you mentioned doesn't easily allows that (without making the API overly complex).

nnop · 2019-07-04T09:28:53Z

The codes of imaug is somehow far more complicated.

fmassa · 2019-07-04T12:58:46Z

@nnop yes, I saw imaug and agree that it is very complex. I would rather have a simpler API, even if it requires having to write a bit more code.

Oktai15 closed this as completed Apr 7, 2018

fmassa added the module: transforms label Jul 4, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Improve transforms #466

Improve transforms #466

Oktai15 commented Apr 7, 2018

fmassa commented Apr 7, 2018

Uh oh!

Oktai15 commented Apr 7, 2018

Uh oh!

fmassa commented Apr 7, 2018

Uh oh!

Oktai15 commented Apr 7, 2018

Uh oh!

fmassa commented Apr 7, 2018

Uh oh!

nnop commented Jul 4, 2019

Uh oh!

fmassa commented Jul 4, 2019

Uh oh!

Improve transforms #466

Improve transforms #466

Comments

Oktai15 commented Apr 7, 2018

fmassa commented Apr 7, 2018

Uh oh!

Oktai15 commented Apr 7, 2018

Uh oh!

fmassa commented Apr 7, 2018

Uh oh!

Oktai15 commented Apr 7, 2018

Uh oh!

fmassa commented Apr 7, 2018

Uh oh!

nnop commented Jul 4, 2019

Uh oh!

fmassa commented Jul 4, 2019

Uh oh!