Faster R-CNN (WIP) #21

fmassa · 2016-12-29T17:51:43Z

This WIP PR implements Faster R-CNN based on https://github.com/rbgirshick/py-faster-rcnn .
There are lots of copy-paste from Girshick's repo, and a few files were copied as is.
I decided to fuse a number of operations together, but I'm not sure anymore it's the best way to go.

A few comments:

There are two main classes: FasterRCNN and RPN. RPN implements the region proposal network, and faster r-cnn wraps almost all the detection logic (which might hurt flexibility at some point).
Both classes behave differently if a ground-truth is provided, avoiding to have to write different network definitions for train/test.
I tried to avoid a global cfg dict containing all the parameters, and instead pass the parameters as the constructors of the classes. But different train/test parameters still need to be defined in the model.
There is currently no handling on the image scaling (originally present in im_info[2]), necessary for properly pruning small boxes in the RPN.
I'll add optimized implementations for ROIPooling/nms/etc later on using cffi.

I'm opening this PR to get some feed-back on the general structure of the code.
Overall, I'm starting to think that it might be better to use the same structure as the one in py-faster-rcnn from Girshick repo.

Code cleanup and organization is required.

No bbox regression

Need to test for correctness

colesbury · 2017-01-06T00:07:19Z

This looks good. I think the most important thing is to get training (and evaluation) working and then worry more about design and cleaning up the code.

Some thoughts:

I think copying from Ross's Faster-RCNN repo makes sense where appropriate. Keeping them in separate files, unchanged if possible, would be best
There's a trade-off between making everything part of the model (nn.Container) and keeping it as part of the training function. For example, FasterRCNN.forward is a bit complicated because it combines training and evaluation logic: it has two possible return types and two possible input types.
The forward() method in modules should take in Variables where appropriate; the wrapping of tensors in Variables should happen outside the module
Try to match PyTorch style (PEP 8) for stuff that's not from RBG's repo. (4 space indent, etc.)
Use Python argparse to configure options. I like how you're not passing a global cfg around. If you want to support a config file as well, I think you can do so with ConfigParser

apaszke

Don't we have to include the license file if we're including the original code?

fast_rcnn/voc.py

+  boxes[:, 2] = width - oldx1 - 1
+  return boxes
+
+class TransformVOCDetectionAnnotation(object):


fast_rcnn/utils.py

+    if isinstance(x, np.ndarray):
+        return Variable(torch.from_numpy(x), requires_grad=False)
+    elif torch.is_tensor(x):
+        return Variable(x, requires_grad=True)


fast_rcnn/faster_rcnn.py

+    cls_crit = nn.CrossEntropyLoss()
+    cls_loss = cls_crit(scores, labels)
+
+    reg_crit = nn.SmoothL1Loss()


fast_rcnn/roi_pooling.py

+from torch.autograd.function import Function
+from torch._thnn import type2backend
+
+class AdaptiveMaxPool2d(Function):


fast_rcnn/rpn.py

+
+  # I need to know the original image size (or have the scaling factor)
+  def get_roi_boxes(self, anchors, rpn_map, rpn_bbox_deltas, im):
+    # TODO fix this!!!


fast_rcnn/rpn.py

+  class_to_ind = dict(zip(cls, range(len(cls))))
+
+
+  train = VOCDetection('/home/francisco/work/datasets/VOCdevkit/', 'train',


apaszke · 2017-01-06T23:26:13Z

Also, there are a lot of commented statements that should be removed before merging

fmassa · 2017-01-06T23:53:39Z

@colesbury @apaszke thanks for your comments. I think the ConfigParser might be a nice way of addressing tons of arguments.

I initially wanted to keep the number of files small (as it is an example code), so I fused a number of things together, but that's probably a poor design choice.

I will validate that the basic code is working as expected by performing a training/evaluation and then I'll focus on getting a refactoring of this PR.

I'll get back to it on Monday, I've a trip to do in Rio tomorrow :)

apaszke · 2017-01-06T23:55:34Z

Ugh nvm, for some reason I haven't noticed that it's WIP...

Have a nice trip! 😃

bhack · 2017-03-23T23:37:08Z

Have you tested https://github.com/longcw/faster_rcnn_pytorch?
/cc @longcw

fmassa · 2017-03-23T23:39:47Z

@bhack I haven't, and I didn't have the time to finish this properly.
Given that there are already a number of pytorch implementations of object detection algorithms in pytorch, I'll close this one for the time being.
If I find some time to finish this up with a simple interface, I'll send a new PR.

bhack · 2017-03-25T09:53:33Z

@KaimingHe In the plan of releasing Mask r-cnn there will be also a faster-rcnn pytorch implementation merged in this repository?

bhack · 2017-04-01T13:04:46Z

A TF WIP Mask R-CNN effort was started at https://github.com/CharlesShang/FastMaskRCNN. Actually there is still no public reference implementation of the paper so we will see what kind of accurancy can be reproduced.

bhack · 2017-04-01T13:14:24Z

/cc @soumith for https://discuss.pytorch.org/t/deep-sharp-mask-or-mask-r-cnn/1469/4

fmassa added 18 commits December 19, 2016 09:49

Changes from yesterday

10caef7

Seems to work

55b2bb0

Change generator

faa3b4e

fast rcnn

f2e9248

No bbox regression

Starting to prototype faster rcnn

3b3f1ae

rpn runs

c653216

Need to test for correctness

frcnn runs

3bee8e6

updating

22e7696

A bit of organization

5e71e6c

Organization

9e65a2f

Rename

e119672

rename

4058094

Cleaning up a bit

a0061e8

Reduce default learning rate

cfb643f

Fixes

e36a936

Removing unnecessary files from tree

e73ee53

Rename

79c2402

minor changes

d8d378c

apaszke reviewed Jan 6, 2017

View reviewed changes

fmassa closed this Mar 23, 2017

runzeer mentioned this pull request Apr 14, 2019

the error when I run the example for the imagenet #544

Closed

		class_to_ind = dict(zip(cls, range(len(cls))))


		train = VOCDetection('/home/francisco/work/datasets/VOCdevkit/', 'train',

Faster R-CNN (WIP) #21

Faster R-CNN (WIP) #21

Uh oh!

Conversation

fmassa commented Dec 29, 2016

Uh oh!

colesbury commented Jan 6, 2017

Uh oh!

apaszke left a comment

Choose a reason for hiding this comment

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

apaszke commented Jan 6, 2017

Uh oh!

fmassa commented Jan 6, 2017

Uh oh!

apaszke commented Jan 6, 2017

Uh oh!

bhack commented Mar 23, 2017

Uh oh!

fmassa commented Mar 23, 2017

Uh oh!

bhack commented Mar 25, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bhack commented Apr 1, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bhack commented Apr 1, 2017

Uh oh!

Uh oh!

bhack commented Mar 25, 2017 •

edited

Loading

bhack commented Apr 1, 2017 •

edited

Loading