Train Faster R-CNN with negative samples #1911

mnc537 · 2020-02-23T18:47:56Z

Hello,

This PR allows FasterRCNN to train with negative samples.
Related to: #1598

When defining the dataset, one needs to set the field boxes in the target dict as torch.zeros((0, 4), dtype=torch.float32) for negative images since boxes is required.

This is how target should look like:

target = {}
target["boxes"] = torch.zeros((0, 4), dtype=torch.float32)
target["labels"] = torch.zeros((1, 1), dtype=torch.int64)
target["image_id"] = image_id
target["area"] = (boxes[:, 3] - boxes[:, 1]) * (boxes[:, 2] - boxes[:, 0])
target["iscrowd"] = torch.zeros((0,), dtype=torch.int64)

cpuhrsch · 2020-03-02T18:24:59Z

I'm out of depth for this particular code-piece and will leave the review to @fmassa

…rRCNN

fmassa

The PR looks great Monica, thanks a lot!

I have only a few minor comments, can you look into it? Once they are addressed this will be good to merge.

Also, I think that we should also make sure that mask r-cnn and keypoint r-cnn work with empty targets, could you look into it in a follow-up PR?

Once again thanks a lot!

fmassa · 2020-03-13T16:56:14Z

torchvision/models/detection/roi_heads.py

@@ -730,7 +743,8 @@ def forward(self, features, proposals, image_shapes, targets=None):
            for t in targets:
                # TODO: https://github.com/pytorch/pytorch/issues/26731
                floating_point_types = (torch.float, torch.double, torch.half)
-                assert t["boxes"].dtype in floating_point_types, 'target boxes must of float type'
+                if t["boxes"] is not None:


nit: this conditional is not needed anymore

fmassa · 2020-03-13T16:58:04Z

torchvision/models/detection/roi_heads.py

+
+            gt_boxes_in_image = gt_boxes[img_id]
+            if gt_boxes_in_image.numel() == 0:
+                gt_boxes_in_image = torch.zeros((1, 4), dtype=dtype)


Do we need to take the device of gt_boxes_in_image into account?

fmassa · 2020-03-13T16:59:03Z

torchvision/models/detection/roi_heads.py

+                clamped_matched_idxs_in_image = torch.zeros(
+                    (proposals_in_image.shape[0],), dtype=torch.int64
+                )
+                labels_in_image = torch.zeros((proposals_in_image.shape[0],), dtype=torch.int64)


I think it would be safer if we also take the device of proposals_in_image into account while creating those tensors

…rRCNN

codecov-io · 2020-03-19T19:27:41Z

Codecov Report

❗ No coverage uploaded for pull request base (master@d45a77d). Click here to learn what that means.
The diff coverage is 0%.

@@           Coverage Diff            @@
##             master   #1911   +/-   ##
========================================
  Coverage          ?   0.48%           
========================================
  Files             ?      92           
  Lines             ?    7442           
  Branches          ?    1133           
========================================
  Hits              ?      36           
  Misses            ?    7393           
  Partials          ?      13

Impacted Files	Coverage Δ
torchvision/models/detection/roi_heads.py	`0% <0%> (ø)`
torchvision/models/detection/rpn.py	`0% <0%> (ø)`

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update d45a77d...b7ed9ac. Read the comment docs.

fmassa

Thanks a lot Monica!

rronan · 2020-03-20T11:12:49Z

Thanks for this feature!

This is how target should look like:

target = {}
target["boxes"] = torch.zeros((0, 4), dtype=torch.float32)
target["labels"] = torch.zeros((1, 1), dtype=torch.int64)
target["image_id"] = image_id
target["area"] = (boxes[:, 3] - boxes[:, 1]) * (boxes[:, 2] - boxes[:, 0])
target["iscrowd"] = torch.zeros((0,), dtype=torch.int64)

One question though: why does target["labels"] look like this? Shouldn't it be of length 0 (like boxes), or with only one dimension (not two) like specified here:
https://github.com/pytorch/vision/blob/master/torchvision/models/detection/faster_rcnn.py#L47

fmassa · 2020-03-20T13:50:57Z

@rronan I agree, I was wondering the same thing. @mnc537 is this a typo in the test?

mnc537 · 2020-03-23T09:47:47Z

@rronan, right! It should be of length 0. There is a typo in the test, @fmassa. I'll fix it.

fmassa · 2020-03-23T14:29:03Z

Thanks @mnc537 !

rronan · 2020-03-24T12:56:41Z

Thank you @mnc537. I've been using this PR and did not experience any issue yet.

* modified FasterRCNN to accept negative samples * remove debug lines * Change torch.zeros_like to torch.zerros * Add unit tests * take the `device` into account Co-authored-by: Francisco Massa <[email protected]>

Conforms to pytorch/vision#1911 and https://github.com/pytorch/vision/blob/51d694e13fb5686aba20822c0bc62679fd2b70b0/test/test_models_detection_negative_samples.py now

Kirayue · 2020-11-11T07:46:25Z

Hi, @fmassa, @mnc537

I have a question about the targe["labels"],

According to the PR,

target = {}
target["boxes"] = torch.zeros((0, 4), dtype=torch.float32)
target["labels"] = torch.zeros((1, 1), dtype=torch.int64)
target["image_id"] = image_id
target["area"] = (boxes[:, 3] - boxes[:, 1]) * (boxes[:, 2] - boxes[:, 0])
target["iscrowd"] = torch.zeros((0,), dtype=torch.int64)

@mnc537 said there was a typo, so the target should look like
target["labels"] = torch.zeros((1, ), dtype=torch.int64) or
target["labels"] = torch.ones((0, ), dtype=torch.int64) or
target["labels"] = torch.zeros((1, 1), dtype=torch.int64). or
target["labels"] = torch.zeros((0, )), dytype=torch.int64

I tried all above but there weren't any errors.

Thank you for the feature.

dccf36 · 2021-06-23T03:15:49Z

Does anyone encounter the problem that loss explodes after I add the negative background

samra-irshad · 2021-08-11T11:36:42Z

@mnc537 Thanks for this feature. Just wondering, do we need to increase the number of classes once we add the targets for negative samples?

Kirayue · 2021-08-11T11:41:29Z

@samra-irshad The number of classes is the number of your objects + 1(negative samples, or image with only background).

https://pytorch.org/vision/stable/models.html#id35

samra-irshad · 2021-08-11T11:51:11Z

@Kirayue So background class (0) and negative samples (images with no object) should have same label? Or I should allocate an additional label to images with no objects?

Kirayue · 2021-08-11T11:58:51Z

@samra-irshad, they are the same, so you do not need to add an additional label to indicate no objects.

ashep29 · 2021-10-25T03:30:50Z

This error occurs when I try to use negative sampling on unlabelled images using:

target["boxes"] = torch.zeros((0, 4), dtype=torch.float32)
target["labels"] = torch.zeros((1, 1), dtype=torch.int64)

ValueError: Expected target boxes to be a tensorof shape [N, 4], got torch.Size([0]).

Any ideas on how to address this?

jodumagpi · 2021-10-27T05:20:52Z

@mnc537 how should the target masks look like??

carsumptive · 2022-03-30T15:24:12Z

Hello, this is a feature that I wish was elaborated upon a bit further in the docs as it is quite useful and I am trying to get it to work. I believe I can implement it when using the Detecto library but I am having a couple of issues, thought it would be best to follow up on this thread if anyone is watching it.

When passing 0,0,0,0 box values into the dataloader, the torchvision library's function in generalizedrcnn stops me for having degenerate boxes because the dimensions are wrong. Any word on working around this? I assume that logic is there for a reason.. This PR just doesn't seem to address that, any help is greatly appreciated.

mnc537 added 2 commits January 30, 2020 08:01

modified FasterRCNN to accept negative samples

4587778

remove debug lines

46b29f3

mnc537 changed the title ~~Negative faster rcnn~~ Train Faster R-CNN with negative samples Feb 23, 2020

mnc537 requested a review from fmassa February 23, 2020 18:50

cpuhrsch self-requested a review February 25, 2020 00:36

cpuhrsch removed their request for review March 2, 2020 18:25

mnc537 and others added 3 commits March 12, 2020 07:55

Change torch.zeros_like to torch.zerros

2dc50f6

Add unit tests

4265ede

Merge branch 'master' of github.com:pytorch/vision into NegativeFaste…

75da147

…rRCNN

fmassa mentioned this pull request Mar 12, 2020

How to feed negative samples during Faster R-CNN training #1598

Closed

fmassa reviewed Mar 13, 2020

View reviewed changes

mnc537 and others added 5 commits March 16, 2020 06:46

take the device into account

83dc23e

Merge branch 'master' of github.com:pytorch/vision into NegativeFaste…

63f5197

…rRCNN

Merge branch 'master' of github.com:pytorch/vision into NegativeFaste…

186af5e

…rRCNN

Merge branch 'master' of github.com:pytorch/vision into NegativeFaste…

1779c9d

…rRCNN

Merge branch 'master' of github.com:pytorch/vision into NegativeFaste…

b7ed9ac

…rRCNN

fmassa approved these changes Mar 20, 2020

View reviewed changes

fmassa merged commit e75b497 into pytorch:master Mar 20, 2020

fmassa mentioned this pull request Apr 7, 2020

Add tests for negative samples for Mask R-CNN and Keypoint R-CNN #2069

Merged

hacdevilliers mentioned this pull request Aug 13, 2020

Reducing false positive jwyang/faster-rcnn.pytorch#469

Open

dkadish added a commit to dkadish/ArtNet2 that referenced this pull request Sep 14, 2020

Redo data model

c07349f

Conforms to pytorch/vision#1911 and https://github.com/pytorch/vision/blob/51d694e13fb5686aba20822c0bc62679fd2b70b0/test/test_models_detection_negative_samples.py now

bw4sz mentioned this pull request Jul 14, 2021

Tiles without annotations are not supported weecology/DeepForest#216

Closed

Train Faster R-CNN with negative samples #1911

Train Faster R-CNN with negative samples #1911

Uh oh!

Conversation

mnc537 commented Feb 23, 2020

Uh oh!

cpuhrsch commented Mar 2, 2020

Uh oh!

fmassa left a comment

Choose a reason for hiding this comment

Uh oh!

fmassa Mar 13, 2020

Choose a reason for hiding this comment

Uh oh!

fmassa Mar 13, 2020

Choose a reason for hiding this comment

Uh oh!

fmassa Mar 13, 2020

Choose a reason for hiding this comment

Uh oh!

codecov-io commented Mar 19, 2020

Codecov Report

Uh oh!

fmassa left a comment

Choose a reason for hiding this comment

Uh oh!

rronan commented Mar 20, 2020

Uh oh!

fmassa commented Mar 20, 2020

Uh oh!

mnc537 commented Mar 23, 2020

Uh oh!

fmassa commented Mar 23, 2020

Uh oh!

rronan commented Mar 24, 2020

Uh oh!

Kirayue commented Nov 11, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dccf36 commented Jun 23, 2021

Uh oh!

samra-irshad commented Aug 11, 2021

Uh oh!

Kirayue commented Aug 11, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

samra-irshad commented Aug 11, 2021

Uh oh!

Kirayue commented Aug 11, 2021

Uh oh!

ashep29 commented Oct 25, 2021

Uh oh!

jodumagpi commented Oct 27, 2021

Uh oh!

carsumptive commented Mar 30, 2022

Uh oh!

Uh oh!

Kirayue commented Nov 11, 2020 •

edited

Loading

Kirayue commented Aug 11, 2021 •

edited

Loading