[Feature] Add transforms for randomly converting image to grayscale #299

sourabhd · 2017-10-16T14:06:37Z

Data augmentation (transform) for randomly converting image into grayscale (with probability p) is useful for handling datasets containing a mix of rgb and grayscale images.

Please check my implementation here
Let me know if this looks reasonable addition. I could send a pull request for the same.

sourabhd · 2017-10-16T14:07:37Z

@soumith @apaszke @fmassa @alykhantejani

alykhantejani · 2017-10-27T08:18:04Z

Hi @sourabhd,

I've taken a quick look at the code and it seems like you convert an image to grayscale and then back to RGB (repeating the grayscale image 3 times).

I'm not sure when you would want to do this, i.e. if your dataset is a mix of RGB and grayscale images I would think you would either want all grayscale images (single channel) or to convert the grayscale ones to 3-channel grayscale and mix these with the RGB ones.

A mix of single channel and 3 channel images wouldn't make sense as your network needs to know the number of input channels?

sourabhd · 2017-10-27T14:43:02Z

@alykhantejani

A 3-channel image could be grayscale if R == G == B
Example image from ImageNet
Grayscale in RGB
Origins

Dataset might contain such 3-channel images like the example shown above
If dataset has single channel images, they need to be converted to 3 channel as pre-trained models are available only for 3 channel. This is usually done by replicating across channels. This leads to the same situation

Why is it needed ?
Labeling of dataset could be expensive (example FACS, emotions etc) and we want to make use of both the colored and grayscale images (instead of throwing away one set). For introducing, invariance to grayscaling of an image, we could employ an augmentation where we randomly (with a probability p) grayscale an image. The idea is that over multiple epochs, the network sees an image as well as its grayscale counterpart (which has the same label) and learns the invariance during backprop accordingly.

alykhantejani · 2017-11-03T19:59:07Z

So I think that this could go both ways i.e. the user wants to change 3-channel images to single channel grayscale or the user wants to change 1-channel images to 3-channel ones. I think a to_grayscale function can be misleading in this case as sometimes is returns 1-channel images and sometimes 3-channel.

Additionally, depending on the users preference, it should be easy for them to encapsulate their desired behavior into a function (using PIL's convert/stack) and chain these together in a Compose?

@fmassa wdyt?

sourabhd · 2017-11-04T16:23:43Z

@alykhantejani In that case we could have two functions to_grayscale_singlechannel and to_grayscale_threechannel.

alykhantejani · 2017-11-06T16:04:58Z

@sourabhd yeah, I'd be happy with a to_grayscale function/transform with a num_output_channels kwarg. Can you send a PR?

alykhantejani · 2017-11-13T16:59:02Z

Fixed via #325

sourabhd changed the title ~~Add transforms for randomly converting image to grayscale~~ [Feature] Add transforms for randomly converting image to grayscale Oct 18, 2017

sourabhd mentioned this issue Nov 7, 2017

transforms: randomly grayscaling an image #325

Merged

alykhantejani closed this as completed Nov 13, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Feature] Add transforms for randomly converting image to grayscale #299

[Feature] Add transforms for randomly converting image to grayscale #299

sourabhd commented Oct 16, 2017

sourabhd commented Oct 16, 2017 •

edited

Loading

Uh oh!

alykhantejani commented Oct 27, 2017

Uh oh!

sourabhd commented Oct 27, 2017

Uh oh!

alykhantejani commented Nov 3, 2017

Uh oh!

sourabhd commented Nov 4, 2017

Uh oh!

alykhantejani commented Nov 6, 2017

Uh oh!

alykhantejani commented Nov 13, 2017

Uh oh!

[Feature] Add transforms for randomly converting image to grayscale #299

[Feature] Add transforms for randomly converting image to grayscale #299

Comments

sourabhd commented Oct 16, 2017

sourabhd commented Oct 16, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

alykhantejani commented Oct 27, 2017

Uh oh!

sourabhd commented Oct 27, 2017

Uh oh!

alykhantejani commented Nov 3, 2017

Uh oh!

sourabhd commented Nov 4, 2017

Uh oh!

alykhantejani commented Nov 6, 2017

Uh oh!

alykhantejani commented Nov 13, 2017

Uh oh!

sourabhd commented Oct 16, 2017 •

edited

Loading