Add softmax_focal_loss() to allow multi-class focal loss #7676

dhruvbird · 2023-06-18T11:30:42Z

In image segmentation tasks, focal loss is useful when trying to classify an image pixel as one of N classes. Unfortunately, sigmoid_focal_loss() isn't useful in such cases. I found that other have been asking for this as well here #3250 so I decided to submit a PR for the same.

I'm opening this PR to check if this is something the pytorch-vision team is interested in merging.

pytorch-bot · 2023-06-18T11:30:44Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/vision/7676

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

facebook-github-bot · 2023-06-18T11:30:47Z

Hi @dhruvbird!

Thank you for your pull request.

We require contributors to sign our Contributor License Agreement, and yours needs attention.

You currently have a record in our system, but the CLA is no longer valid, and will need to be resubmitted.

Process

In order for us to review and merge your suggested changes, please sign at https://code.facebook.com/cla. If you are contributing on behalf of someone else (eg your employer), the individual CLA may not be sufficient and your employer may need to sign the corporate CLA.

Once the CLA is signed, our tooling will perform checks and validations. Afterwards, the pull request will be tagged with CLA signed. The tagging process may take up to 1 hour after signing. Please give it that time before contacting us about it.

If you have received this in error or have any questions, please contact us at [email protected]. Thanks!

rehno-lindeque · 2023-07-25T18:31:50Z

torchvision/ops/focal_loss.py

+        alpha (float): Weighting factor in range (0,1) to balance
+                positive vs negative examples or -1 for ignore. Default: ``0.25``.


I don't see a conditional checking if alpha is -1 anywhere.

rehno-lindeque · 2023-07-25T19:19:01Z

torchvision/ops/focal_loss.py

+    # need to compute the softmax manually anyway. We don't implement that
+    # here for brevity, but this code can be extended for such a use-case.
+    pt = torch.exp(-ce_loss)
+    focal_loss = alpha * ((1 - pt) ** gamma) * ce_loss


I have my doubts that the alpha term used here is correct. In fact we want alpha_t.

Although the paper doesn't explicitly give the formula for alpha_t, it states in it's definition

For notational convenience, we define $α_t$ analogously to how we defined $p_t$

So given that

$ \begin{align*} p_t = \begin{cases} \alpha & \text{if } y = 1 \\ (1 - p) & \text{otherwise} \end{cases} \end{align*}$

Therefore I believe $\alpha_t$ should be analogously be interpreted as

$ \begin{align*} \alpha_t = \begin{cases} \alpha & \text{if } y = 1 \\ (1 - \alpha) & \text{otherwise} \end{cases} \end{align*}$

Torchvision's sigmoid_focal_loss does something like this in its implementation:

vision/torchvision/ops/focal_loss.py

Lines 43 to 45 in cc0f9d0

if alpha >= 0:

alpha_t = alpha * targets + (1 - alpha) * (1 - targets)

loss = alpha_t * loss

However I've seen another implementation in the wild pass through alpha to the weights of nll_loss:

AdeelH/pytorch-multi-class-focal-loss/focal_loss.py

This strikes me as the correct approach since it allows one to weigh each class separately in a multi-class setting where there are no "negative" classes.

In other words, there's a subtle difference in the intent here versus the sigmoid BCE approach (sigmoid_focal_loss) where every class is effectively split into separate positive / negative predictions.

I agree with your assessment above.

I have put up a revised PR at #7760 since I switched local branches.

facebook-github-bot added the cla signed label Jun 18, 2023

rehno-lindeque reviewed Jul 25, 2023

View reviewed changes

dhruvbird force-pushed the main branch from 331fa9c to 83c100c Compare July 25, 2023 20:50

dhruvbird closed this Jul 26, 2023

dhruvbird force-pushed the main branch from 0e10b11 to 8324c48 Compare July 26, 2023 00:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add softmax_focal_loss() to allow multi-class focal loss #7676

Add softmax_focal_loss() to allow multi-class focal loss #7676

Uh oh!

dhruvbird commented Jun 18, 2023

Uh oh!

pytorch-bot bot commented Jun 18, 2023

Uh oh!

facebook-github-bot commented Jun 18, 2023

Uh oh!

rehno-lindeque Jul 25, 2023 •

edited

Loading

Uh oh!

rehno-lindeque Jul 25, 2023 •

edited

Loading

Uh oh!

dhruvbird Jul 26, 2023

Uh oh!

Uh oh!

		alpha (float): Weighting factor in range (0,1) to balance
		positive vs negative examples or -1 for ignore. Default: ``0.25``.

	if alpha >= 0:
	alpha_t = alpha * targets + (1 - alpha) * (1 - targets)
	loss = alpha_t * loss

Add softmax_focal_loss() to allow multi-class focal loss #7676

Add softmax_focal_loss() to allow multi-class focal loss #7676

Uh oh!

Conversation

dhruvbird commented Jun 18, 2023

Uh oh!

pytorch-bot bot commented Jun 18, 2023

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/vision/7676

Uh oh!

facebook-github-bot commented Jun 18, 2023

Process

Uh oh!

rehno-lindeque Jul 25, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

rehno-lindeque Jul 25, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

dhruvbird Jul 26, 2023

Choose a reason for hiding this comment

Uh oh!

Uh oh!

rehno-lindeque Jul 25, 2023 •

edited

Loading

rehno-lindeque Jul 25, 2023 •

edited

Loading