Tiebreakers: move into the handshake #1931

NickCraver · 2021-12-14T17:17:51Z

Currently the way we handshake is to get everything configured, wait for a tracer to complete, and then issue the tiebreakers to all servers if they are in play. This complicates a few things with respect to timings, duplication, and write paths being a one-off for tie breakers, which I tripped on hard in #1912.

In this, we instead move the tie breaker fetch as part of AutoConfigure as a fire-and-forget-process-the-result-later setup with a dedicated processor. This all happens before the tracer fires moving us to the next connection phase (added comments) so we should be safe. It should reduce both complexity and overall connection times proportional to endpoint latency (since we wait for completion right now).

What needs adding here is tests with us disabling commands like INFO, GET, etc. and ensuring things still behave as we want. In the overall, the tie breaker is slightly less isolated but should be happening in the same order and with the same exception if any - no net result change is intended there with respect to how we do or don't error along the way. But we never want a connection to fail because of a tiebreaker and I think that warrants a few tests:

Disable INFO and see if we can connect
Disable GET and see if we can connect
Store some invalid TieBreaker and see if we can connect (e.g. make it a hash instead of a string)
...and maybe others?

Currently the way we handshake is to get everything configured, wait for a tracer to complete, and then issue the tiebreakers to all servers if they are in play. This complicates a few things with respect to timings, duplication, and write paths being a one-off for tie breakers, which I tripped on hard in #1912. In this, we instead move the tie breaker fetch as part of AutoConfigure as a fire-and-forget-process-the-result-later setup with a dedicated processor. This all happens before the tracer fires moving us to the next connection phase (added comments) so we should be safe. It should reduce both complexity and overall connection times proportional to endpoint latency (since we wait for completion right now). What needs adding here is tests with us disabling commands like INFO, GET, etc. and ensuring things still behave as we want. In the overall, the tie breaker is slightly less isolated but _should_ be happening in the same order and with the same exception if any - no net result change is intended there with respect to how we do or don't error along the way. But we never want a connection to fail _because of a tiebreaker_ and I think that warrants a few tests: - [ ] Disable `INFO` and see if we can connect - [ ] Disable `GET` and see if we can connect - [ ] Store some invalid TieBreaker and see if we can connect (e.g. make it a hash instead of a string) ...and maybe others?

mgravell

some comments, but: 👍

src/StackExchange.Redis/ResultProcessor.cs

mgravell · 2021-12-15T09:59:02Z

src/StackExchange.Redis/ConnectionMultiplexer.cs

-                    var status = tieBreakers[i].Status;
-                    switch (status)
+                    var server = servers[i];
+                    string serverResult = server.TieBreakerResult;


should we nuke this here? or not worth the additional race conditions that would introduce for overlapped/staggered connects; meh, probably not worth the complexity risk - just wanted to share a thought

hmmm, I like the thinking - seems like we'd want to change where it's stored so it's transient in a connection state object or some such and not stored on the server but as a lookup that has the lifetime of the connect handshake perhaps?

NickCraver · 2021-12-15T13:29:06Z

Follow-up: going to look at moving this into a "handshake state" object we pass around to AutoConfigure and into the result processor, but that's a bigger refactor so merging here then revisiting.

Annnnnnd this is why we add tests, we would have tried to issue the GET and never connected in previous code, bad Craver, bad!

NickCraver · 2021-12-28T17:21:56Z

@mgravell Tweaked code here a bit (to not issue tie breakers when GET is disabled) and added tests, ready for eyes when back in action!

…reaker

NickCraver added the ⚙️ area:connection label Dec 14, 2021

NickCraver requested review from mgravell and philon-msft December 14, 2021 17:17

NickCraver added 2 commits December 14, 2021 12:50

Add release notes

e830cb0

Merge branch 'main' into craver/handshake-tiebreaker

ed60b39

mgravell approved these changes Dec 15, 2021

View reviewed changes

PR fixes!

a79c89d

mgravell approved these changes Dec 15, 2021

View reviewed changes

NickCraver added 2 commits December 28, 2021 12:13

Tiebreaker: add tests

fa5200c

Annnnnnd this is why we add tests, we would have tried to issue the GET and never connected in previous code, bad Craver, bad!

Add incorrect tiebreaker type test

fb6a135

Merge branch 'main' into craver/handshake-tiebreaker

0ca6c47

NickCraver requested a review from mgravell January 2, 2022 20:29

NickCraver mentioned this pull request Jan 5, 2022

Required ACLs to self configure #1795

Closed

mgravell approved these changes Jan 5, 2022

View reviewed changes

NickCraver added 2 commits January 5, 2022 11:45

Merge branch 'main' into craver/handshake-tiebreaker

a0a88d4

Merge remote-tracking branch 'origin/main' into craver/handshake-tieb…

0778202

…reaker

NickCraver merged commit 35d3e9c into main Jan 5, 2022

NickCraver deleted the craver/handshake-tiebreaker branch January 5, 2022 17:00

NickCraver mentioned this pull request Jan 8, 2022

Logical bug related to the reordering issue - SendTracer PING in tiebreak happens before AUTH #1759

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Tiebreakers: move into the handshake #1931

Tiebreakers: move into the handshake #1931

Uh oh!

NickCraver commented Dec 14, 2021 •

edited

Loading

Uh oh!

mgravell left a comment

Uh oh!

Uh oh!

Uh oh!

mgravell Dec 15, 2021

Uh oh!

NickCraver Dec 15, 2021

Uh oh!

NickCraver commented Dec 15, 2021

Uh oh!

NickCraver commented Dec 28, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Tiebreakers: move into the handshake #1931

Tiebreakers: move into the handshake #1931

Uh oh!

Conversation

NickCraver commented Dec 14, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mgravell left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

mgravell Dec 15, 2021

Choose a reason for hiding this comment

Uh oh!

NickCraver Dec 15, 2021

Choose a reason for hiding this comment

Uh oh!

NickCraver commented Dec 15, 2021

Uh oh!

NickCraver commented Dec 28, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

NickCraver commented Dec 14, 2021 •

edited

Loading