Fix flakiness on StochasticDepth test #4758

datumbox · 2021-10-27T11:41:27Z

Resolves partially #4506

cc @pmeier

facebook-github-bot · 2021-10-27T11:41:34Z

💊 CI failures summary and remediations

As of commit 59c1c8b (more details on the Dr. CI page):

💚 💚 Looks good so far! There are no failures yet. 💚 💚

This comment was automatically generated by Dr. CI (expand for details).

Please report bugs/suggestions to the (internal) Dr. CI Users group.

Click here to manually regenerate this comment.

datumbox

Adding some highlights on the changes:

datumbox · 2021-10-27T11:47:47Z

test/test_ops.py

    @pytest.mark.parametrize("p", [0.2, 0.5, 0.8])
    @pytest.mark.parametrize("mode", ["batch", "row"])
-    def test_stochastic_depth(self, mode, p):
+    def test_stochastic_depth_random(self, seed, mode, p):
+        torch.manual_seed(seed)


We maintain the original test because it allows us to check that the different mode values operate as expected. Using p values in the interval (0, 1) is critical because for p=0 and p=1 the two modes behave the same. The mitigation for the flakiness here is to set the seed.

On my laptop the test passed over 97/100 seeds so there's a very small change it will fail in the future. (first failure on the 11th seed lol)

I agree it's fine to leave as-is for now

It's because I increased the p-value threshold to 1%. You kind of expect false positives around that rate now, hopefully we won't see it due to the seed setting.

datumbox · 2021-10-27T11:48:16Z

test/test_ops.py

@@ -1173,7 +1175,22 @@ def test_stochastic_depth(self, mode, p):
                num_samples += batch_size

        p_value = stats.binom_test(counts, num_samples, p=p)
-        assert p_value > 0.0001
+        assert p_value > 0.01


Significantly increase the threshold since now we only check 10 seeds. We can reduce if flakiness continues.

datumbox · 2021-10-27T11:49:21Z

test/test_ops.py

+    @pytest.mark.parametrize("seed", range(10))
+    @pytest.mark.parametrize("p", (0, 1))
+    @pytest.mark.parametrize("mode", ["batch", "row"])
+    def test_stochastic_depth(self, seed, mode, p):


Adding an additional test with p=0 and p=1 to confirm it works as expected for the extreme values.

datumbox · 2021-10-27T11:50:22Z

torchvision/ops/stochastic_depth.py

-    noise = noise.bernoulli_(survival_rate).div_(survival_rate)
+    noise = noise.bernoulli_(survival_rate)
+    if survival_rate > 0.0:
+        noise.div_(survival_rate)


This was actually a bug; the previous code produced nans! Though it really isn't something that users will face. Setting p=1 to the operator means that you will always drop the block (set it to 0). I'm not sure that this is something many users would like to do, but worth fixing anyway.

NicolasHug

Thanks @datumbox!

test/test_ops.py

NicolasHug · 2021-10-27T12:01:15Z

test/test_ops.py

    @pytest.mark.parametrize("p", [0.2, 0.5, 0.8])
    @pytest.mark.parametrize("mode", ["batch", "row"])
-    def test_stochastic_depth(self, mode, p):
+    def test_stochastic_depth_random(self, seed, mode, p):
+        torch.manual_seed(seed)


On my laptop the test passed over 97/100 seeds so there's a very small change it will fail in the future. (first failure on the 11th seed lol)

I agree it's fine to leave as-is for now

Summary: * Fix flakiness on the TestStochasticDepth test. * Fix minor bug when p=1.0 * Remove device and dtype setting. Reviewed By: datumbox Differential Revision: D32064694 fbshipit-source-id: 4107800cb6f8e56bcd85db31176afae394b86a21

* Fix flakiness on the TestStochasticDepth test. * Fix minor bug when p=1.0 * Remove device and dtype setting.

datumbox added 2 commits October 27, 2021 12:39

Fix flakiness on the TestStochasticDepth test.

932bcc2

Fix minor bug when p=1.0

34886ca

datumbox added enhancement module: ops module: tests labels Oct 27, 2021

pytorch-probot bot added the ciflow/default label Oct 27, 2021

facebook-github-bot added the cla signed label Oct 27, 2021

NicolasHug mentioned this pull request Oct 27, 2021

Fix flaky tests that are recently popping up #4506

Closed

14 tasks

datumbox commented Oct 27, 2021

View reviewed changes

NicolasHug approved these changes Oct 27, 2021

View reviewed changes

datumbox and others added 2 commits October 27, 2021 13:17

Remove device and dtype setting.

b445bfb

Merge branch 'main' into tests/flaky_stochastic_depth

59c1c8b

datumbox merged commit bbfda42 into pytorch:main Oct 27, 2021

datumbox deleted the tests/flaky_stochastic_depth branch October 27, 2021 13:27

cyyever pushed a commit to cyyever/vision that referenced this pull request Nov 16, 2021

Fix flakiness on StochasticDepth test (pytorch#4758)

24d108c

* Fix flakiness on the TestStochasticDepth test. * Fix minor bug when p=1.0 * Remove device and dtype setting.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix flakiness on StochasticDepth test #4758

Fix flakiness on StochasticDepth test #4758

Uh oh!

datumbox commented Oct 27, 2021 •

edited by pytorch-probot bot

Loading

Uh oh!

facebook-github-bot commented Oct 27, 2021 •

edited

Loading

Uh oh!

datumbox left a comment

Uh oh!

datumbox Oct 27, 2021

Uh oh!

NicolasHug Oct 27, 2021

Uh oh!

datumbox Oct 27, 2021

Uh oh!

datumbox Oct 27, 2021

Uh oh!

datumbox Oct 27, 2021

Uh oh!

datumbox Oct 27, 2021

Uh oh!

NicolasHug left a comment

Uh oh!

Uh oh!

NicolasHug Oct 27, 2021

Uh oh!

Uh oh!

Fix flakiness on StochasticDepth test #4758

Fix flakiness on StochasticDepth test #4758

Uh oh!

Conversation

datumbox commented Oct 27, 2021 • edited by pytorch-probot bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

facebook-github-bot commented Oct 27, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

💊 CI failures summary and remediations

Uh oh!

datumbox left a comment

Choose a reason for hiding this comment

Uh oh!

datumbox Oct 27, 2021

Choose a reason for hiding this comment

Uh oh!

NicolasHug Oct 27, 2021

Choose a reason for hiding this comment

Uh oh!

datumbox Oct 27, 2021

Choose a reason for hiding this comment

Uh oh!

datumbox Oct 27, 2021

Choose a reason for hiding this comment

Uh oh!

datumbox Oct 27, 2021

Choose a reason for hiding this comment

Uh oh!

datumbox Oct 27, 2021

Choose a reason for hiding this comment

Uh oh!

NicolasHug left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

NicolasHug Oct 27, 2021

Choose a reason for hiding this comment

Uh oh!

Uh oh!

datumbox commented Oct 27, 2021 •

edited by pytorch-probot bot

Loading

facebook-github-bot commented Oct 27, 2021 •

edited

Loading