C++: Fix performance issue on cpp/comma-before-misleading-indentation #10958

geoffw0 · 2022-10-24T18:31:36Z

C++: Fix rare performance issue on cpp/comma-before-misleading-indentation. The issue was _Call#39248e3c::Call::getAnArgument#0#dispred#ff__Expr#ef463c5d::Expr::getFullyConverted#0#dispred#f__#higher_order_body exploding (found on nba-emu_NanoBoyAdvance), which I interpret to mean that normalizeExpr was just getting too big on deep Expr trees (probably with lots of co-located hidden Exprs).

The new solution calculates the lowest relevant column number first (min(getCandidateColumn(e))), which should be less affected by co-located Exprs. Then in the main calculation that follows a lot of Exprs can be culled early using this relation.

Its unfortunately more code than before but I've tried to explain in the comments.

I'll run DCA on this, but here are some local stats:

BEFORE                                               RESULTS  TIME
nba-emu_NanoBoyAdvance (clean cache)                 TIMEOUT
MongoDB-2.2.3 (clean cache)                          0        13s
abseil-cpp (cache warmed up by `cpp/dead-code-goto`) 0        31s
torvalds_linux (cache warmed up ")                   0        49s

AFTER                                                RESULTS  TIME
nba-emu_NanoBoyAdvance (clean cache)                 0        33s
MongoDB-2.2.3 (clean cache)                          0        15s
abseil-cpp (cache warmed up by `cpp/dead-code-goto`) 0        38s
torvalds_linux (cache warmed up ")                   0        54s

@MathiasVP

…ation.

MathiasVP

A couple of questions. To me, this looks like a join order issue in the original code:

Tuple counts for _Call#39248e3c::Call::getAnArgument#0#dispred#ff_1#join_rhs__Expr#ef463c5d::Expr::getFullyConverted#__#antijoin_rhs/2@f9e794sd after 5m9s:
  2476000    ~4%     {2} r1 = JOIN _Expr#ef463c5d::Expr::getFullyConverted#0#dispred#ff_m#CommaBeforeMisleadingIndentation#36b1fbef::no__#shared WITH boundedFastTC:Expr#ef463c5d::Expr::getParentWithConversions#0#dispred#ff_10#higher_order_body:_Expr#ef463c5d::Expr::getFullyConverted#0#dispred#ff_m#CommaBeforeMisleadingIndentation#36b1fbef::no__#higher_order_body ON FIRST 1 OUTPUT Rhs.1 'arg1', Lhs.1 'arg0'
  
  2482000    ~4%     {2} r2 = _Expr#ef463c5d::Expr::getFullyConverted#0#dispred#ff_m#CommaBeforeMisleadingIndentation#36b1fbef::no__#shared UNION r1
  
  354000     ~5%     {2} r3 = JOIN r2 WITH Call#39248e3c::Call::getAnArgument#0#dispred#ff_1#join_rhs ON FIRST 1 OUTPUT Lhs.1 'arg0', Lhs.0 'arg1'
  
  9424361000 ~1%     {3} r4 = JOIN r2 WITH boundedFastTC:Expr#ef463c5d::Expr::getParentWithConversions#0#dispred#ff:__Expr#ef463c5d::Expr::getFullyConverted#0#dispred#ff_m#CommaBeforeMisleadingIndentation#36b1fbef::n__#higher_order_body ON FIRST 1 OUTPUT Rhs.1, Lhs.1 'arg0', Lhs.0 'arg1'
  0          ~0%     {2} r5 = JOIN r4 WITH Call#39248e3c::Call::getAnArgument#0#dispred#ff_1#join_rhs ON FIRST 1 OUTPUT Lhs.1 'arg0', Lhs.2 'arg1'
  
  354000     ~5%     {2} r6 = r3 UNION r5
                      return r6

I was able to hand-hold the optimizer to get rid of this join order here. It's still not as fast as your version, though.

cpp/ql/src/Best Practices/Likely Errors/CommaBeforeMisleadingIndentation.ql

MathiasVP · 2022-10-25T07:48:37Z

cpp/ql/src/Best Practices/Likely Errors/CommaBeforeMisleadingIndentation.ql

+  or
+  not e.getLocation().getStartColumn() = min(getCandidateColumn(e)) and
+  result = normalizeExpr(childWithConversions(e)) and
+  result.getLocation().getStartColumn() = min(getCandidateColumn(e))


Is this final conjunct necessary to prevent the normalizeExpr call on line 47 from having multiple results? If so, could we get rid of this final conjunct by wrapping line 47 in a min that orders by the start location? (Or does this run into non-monotonicity issues?

I believe line 48 is necessary to ensure we get a global minimum Expr, not just the minimum under one particular subtree (child of e). It doesn't appear to affect any of the tests if I remove it, but I think it would be wrong to do so.

Wrapping line 47 in min does indeed hit non-monotonicity issues (which seem to plague this particular predicate).

cpp/ql/src/Best Practices/Likely Errors/CommaBeforeMisleadingIndentation.ql

geoffw0 · 2022-10-25T13:25:37Z

I was able to hand-hold the optimizer to get rid of this join order here. It's still not as fast as your version, though.

I wonder if the restrictions you use here (ChildOfCommaOperand etc) would benefit my version as well?

geoffw0 · 2022-10-25T13:42:37Z

I wonder if the restrictions you use here (ChildOfCommaOperand etc) would benefit my version as well?

I haven't been able to find an effective way to combine them. :(

MathiasVP

LGTM!

C++: Fix rare performance issue on cpp/comma-before-misleading-indent…

6f77e14

…ation.

geoffw0 added the C++ label Oct 24, 2022

geoffw0 requested a review from a team as a code owner October 24, 2022 18:31

MathiasVP added the no-change-note-required This PR does not need a change note label Oct 25, 2022

MathiasVP reviewed Oct 25, 2022

View reviewed changes

C++: Rename predicate.

257748d

MathiasVP approved these changes Oct 26, 2022

View reviewed changes

MathiasVP merged commit 58b6c45 into github:main Oct 26, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

C++: Fix performance issue on cpp/comma-before-misleading-indentation #10958

C++: Fix performance issue on cpp/comma-before-misleading-indentation #10958

Uh oh!

geoffw0 commented Oct 24, 2022

Uh oh!

MathiasVP left a comment

Uh oh!

Uh oh!

MathiasVP Oct 25, 2022

Uh oh!

geoffw0 Oct 25, 2022

Uh oh!

Uh oh!

geoffw0 commented Oct 25, 2022

Uh oh!

geoffw0 commented Oct 25, 2022

Uh oh!

MathiasVP left a comment

Uh oh!

Uh oh!

C++: Fix performance issue on cpp/comma-before-misleading-indentation #10958

C++: Fix performance issue on cpp/comma-before-misleading-indentation #10958

Uh oh!

Conversation

geoffw0 commented Oct 24, 2022

Uh oh!

MathiasVP left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

MathiasVP Oct 25, 2022

Choose a reason for hiding this comment

Uh oh!

geoffw0 Oct 25, 2022

Choose a reason for hiding this comment

Uh oh!

Uh oh!

geoffw0 commented Oct 25, 2022

Uh oh!

geoffw0 commented Oct 25, 2022

Uh oh!

MathiasVP left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!