Use frozen BN only if pre-trained backbone #5443

datumbox · 2022-02-18T21:49:23Z

Currently, the majority of our Detection models replace the BatchNorm2d layers with FrozenBatchNorm2d. This is a reasonable mitigation that improves the stability of training for small batch-sizes. Unfortunately, our current implementation freezes the BNs even when they are completely randomly initialized. Since FrozenBatchNorm2d freezes both the running stats and the affine parameters, its parameters get initialized and fixed to values to:

vision/torchvision/ops/misc.py

Lines 30 to 33 in 0c2373d

    
           self.register_buffer("weight", torch.ones(num_features)) 
        
           self.register_buffer("bias", torch.zeros(num_features)) 
        
           self.register_buffer("running_mean", torch.zeros(num_features)) 
        
           self.register_buffer("running_var", torch.ones(num_features))

Consequently, the BN layers are effectively completely disabled for those who try to train the models from scratch.

This PR fixes the issue by replacing the BNs with FrozenBNs when at least some pre-trained weights are loaded.

facebook-github-bot · 2022-02-18T21:49:30Z

💊 CI failures summary and remediations

As of commit d222c46 (more details on the Dr. CI page):

1/1 failures introduced in this PR

1 failure not recognized by patterns:

Job	Step	Action
^{cmake_macos_cpu}	^{curl -o conda.sh https://repo.anaconda.com/miniconda/Miniconda3-latest-MacOSX-x86_64.sh}
sh conda.sh -b
source $HOME/miniconda3/bin/activate
conda install -yq conda-build cmake
packaging/build_cmake.sh
	🔁 rerun

This comment was automatically generated by Dr. CI (expand for details).

Please report bugs/suggestions to the (internal) Dr. CI Users group.

Click here to manually regenerate this comment.

fmassa

Thanks!

fmassa · 2022-03-07T09:26:52Z

torchvision/models/detection/faster_rcnn.py

-    trainable_backbone_layers = _validate_trainable_layers(
-        pretrained or pretrained_backbone, trainable_backbone_layers, 5, 3
-    )
+    is_trained = pretrained or pretrained_backbone


nit: if the model was trained for detection with large batch sizes from scratch, and then we finetune it afterwards (still with large batch sizes) then in this case we would be using FrozenBatchNorm.

This is an ok heuristic, but hints that we might want to make this an explicit parameter from the constructor in the future

Reviewed By: vmoens Differential Revision: D34878996 fbshipit-source-id: 690b04fe0810cbd45ed582067b79f7e4254c054e

Use frozen BN only if pre-trained.

0f6fa39

datumbox added bug module: models topic: object detection labels Feb 18, 2022

datumbox requested a review from fmassa February 18, 2022 21:49

pytorch-bot bot added the ciflow/default label Feb 18, 2022

facebook-github-bot added the cla signed label Feb 18, 2022

datumbox mentioned this pull request Feb 19, 2022

Post-paper Detection Optimizations #5444

Merged

datumbox added 7 commits February 24, 2022 19:12

Merge branch 'main' into bug/frozen_bn

e5fb488

Merge branch 'main' into bug/frozen_bn

07ccba4

Merge branch 'main' into bug/frozen_bn

a2880a5

Merge branch 'main' into bug/frozen_bn

6363640

Merge branch 'main' into bug/frozen_bn

7b75575

Merge branch 'main' into bug/frozen_bn

81a335f

Merge branch 'main' into bug/frozen_bn

38c5d10

fmassa approved these changes Mar 7, 2022

View reviewed changes

Merge branch 'main' into bug/frozen_bn

d222c46

datumbox merged commit 350a3e8 into pytorch:main Mar 7, 2022

datumbox deleted the bug/frozen_bn branch March 7, 2022 11:34

xwang233 mentioned this pull request Mar 10, 2022

Seeing maskrcnn_resnet50_fpn + FP32 performance drop by 15% after #5443 #5580

Open

facebook-github-bot pushed a commit that referenced this pull request Mar 15, 2022

[fbsync] Use frozen BN only if pre-trained. (#5443)

96a5fd3

Reviewed By: vmoens Differential Revision: D34878996 fbshipit-source-id: 690b04fe0810cbd45ed582067b79f7e4254c054e

datumbox mentioned this pull request Sep 7, 2022

Inconsistent use of FrozenBatchNorm in Faster-RCNN? #6543

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Use frozen BN only if pre-trained backbone #5443

Use frozen BN only if pre-trained backbone #5443

Uh oh!

datumbox commented Feb 18, 2022

Uh oh!

facebook-github-bot commented Feb 18, 2022 •

edited

Loading

Uh oh!

fmassa left a comment

Uh oh!

fmassa Mar 7, 2022

Uh oh!

Uh oh!

	self.register_buffer("weight", torch.ones(num_features))
	self.register_buffer("bias", torch.zeros(num_features))
	self.register_buffer("running_mean", torch.zeros(num_features))
	self.register_buffer("running_var", torch.ones(num_features))

Use frozen BN only if pre-trained backbone #5443

Use frozen BN only if pre-trained backbone #5443

Uh oh!

Conversation

datumbox commented Feb 18, 2022

Uh oh!

facebook-github-bot commented Feb 18, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

💊 CI failures summary and remediations

1 failure not recognized by patterns:

Uh oh!

fmassa left a comment

Choose a reason for hiding this comment

Uh oh!

fmassa Mar 7, 2022

Choose a reason for hiding this comment

Uh oh!

Uh oh!

facebook-github-bot commented Feb 18, 2022 •

edited

Loading