KEP 1645: add more conflict condition on asymetrical traffic #5706

MrFreezeex · 2025-11-25T17:55:10Z

One-line PR description: add more conflict condition on asymetrical traffic

Issue link: Multi-Cluster Services API #1645

Other comments:

Make ports raise a conflict when it's not a exact match and a note describing that implementation must not redirect traffic to endpoints from services that actually doesn't declare this port.

Also suggest doing the same for IPFamilies which might have asymmetrical issues. It's merely a suggestion as IPfamilies handling are implementation defined and some implementation may not have issues like that.

k8s-ci-robot · 2025-11-25T17:55:19Z

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by: MrFreezeex
Once this PR has been reviewed and has the lgtm label, please assign jeremyot for approval. For more information see the Code Review Process.

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

keps/sig-multicluster/OWNERS

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

mikemorris · 2025-11-25T18:27:14Z

Suggested approach makes sense to me and feels less disruptive than changing the guidance from union to intersection.

/lgtm

zhiying-lin · 2025-11-28T02:33:51Z

LGTM, thank you Arthur!

lauralorenz · 2025-12-03T17:19:13Z

keps/sig-multicluster/1645-multi-cluster-services-api/README.md

 set of exported services don’t match, the clusterset service will expose the
-union of service ports declared on its constituent services.
+union of service ports declared on its constituent services and raise a `PortConflict`
+conflict condition. In that case, network traffic must be directed only to endpoints


In sentence for how IPFamilies should be handled above, its directed that the implementer "may" raise a conflict, while this one I'm commenting on here which is for ports says they "will". This line about ports is also more strict on what must be done for routing ("must be directed only") vs how it is described above for IPFamilies ("might result in network traffic reaching only a subset"). Is the difference in how these are treated on purpose? Based on what I saw from the notes from when we discussed in SIG-MC (ref) I think they should both mandate that the conflict raise should be required but how the implementation routes should be implementation defined.

Ah yes indeed, I used "may" for IPFamilies because the exact handling is all implementation defined but since there is a "when" in the sentence which may not apply to some implementations it seems fine to change the "may" by a "must" and some implementations won't need to care about that at all. We would most likely not be able to check that in the conformance tests though but that's a separate concerns from the KEP anyway!

keps/sig-multicluster/1645-multi-cluster-services-api/README.md

Signed-off-by: Arthur Outhenin-Chalandre <[email protected]>

lauralorenz · 2025-12-09T18:12:31Z

Talked in SIG-MC 12/9 about the change and how this PR is addressing two categories of thing (IPFamilies, Ports) that themselves have two things to address (whether to raise a conflict condition, and how prescriptive the KEP is about how predictable the routing is thereafter). The wording at the time seemed to me to have IPFamilies be strict on raising condition, but loose on routing, while in the Ports case it was strict on both. I talked about how I wanted the two categories (IPFamilies and Ports) the same in how they approach both of those since to me they seemed the same problem. As of now the wording was updated so that they treat each the same, so

/lgtm

That being said I want to add some background on what we talked about because I think it's relevant for any future changes.

We did talk a little bit about how a service with some of its endpoints having a new port (like old backends exposing port 80 but newer backends exposing port 80 and port 81) how it's a little different in how the consumer chooses which one to contact (as they may be materially different applications?) vs what we think a consumer would expect would be different from contacting a service with IPv4 vs IPv6 (though Arthur said it could be possible an intermediary gateway could still make those meaningfully different services in a predictable way if a user wanted to, lol). THEN I got stuck on how that situation was an invalid representation of the philosophical assumption in MCS that all backends are fungible as the same Service. THEN THEN Arthur brought up that the fact that there is a conflict at all especially at the Port level is already degrading that assumption in the first place. And in the end we got to a place where the routing is now not prescriptive for either of them, BUT I'm open to hardening that in the future especially as we see what implementations do.

tpantelis · 2025-12-09T23:02:39Z

keps/sig-multicluster/1645-multi-cluster-services-api/README.md

-union of service ports declared on its constituent services.
+union of service ports declared on its constituent services and raise a `PortConflict`
+conflict condition. In that case, network traffic should be directed only to endpoints
+from constituent services that actually expose the targeted port.


Can you clarify "targeted port", specifically in relation to the prior language that talks about "ports" (plural)? I assume by "targeted port" you mean a port that is in conflict, meaning one that is configured on one constituent service but not another. It sounds like you're saying such a port should still be exposed but only from the constituent cluster that has it. If so, is this now a strict requirement for implementations?

BTW, in Submariner, we only expose a port if it's configured for every constituent service.

Can you clarify "targeted port", specifically in relation to the prior language that talks about "ports" (plural)? I assume by "targeted port" you mean a port that is in conflict, meaning one that is configured on one constituent service but not another.

"targeted port" reference the network traffic (will try to clarify the sentence) but yes it match your explanation.

If so, is this now a strict requirement for implementations?

This PR started like that but the must is now a should so it's more a recommendation for implementation than a strict requirement in the current text then.

BTW, in Submariner, we only expose a port if it's configured for every constituent service.

Ah! But how does that works with the fact that the ServiceImport is exposing a union somehow?

In Cilium we do the union on the ServiceImport like MCS-API KEP is enforcing and pass it down directly to our derived Service and we keep the port name/number from the EndpointSlice/backend that we get from all the clusters. And IIUC what happens in kube-proxy and similarly in Cilium is that the port is matched by its name between the EndpointSlice/backends and the Service. So if you add a new port we will correctly only route traffic to the constituent clusters that actually have this port exposed, however if there is a conflict on the port name it might have some weird behavior (and it makes me think that I should probably at least document that edge case on our side 😅).

Ah! But how does that works with the fact that the ServiceImport is exposing a union somehow?

The ServiceImport union is just for conformance - we don't use it. The real action is with the EndpointSlices.

k8s-ci-robot added the cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. label Nov 25, 2025

k8s-ci-robot requested review from RainbowMango and ryanzhang-oss November 25, 2025 17:55

k8s-ci-robot added kind/kep Categorizes KEP tracking issues and PRs modifying the KEP directory sig/multicluster Categorizes an issue or PR as relevant to SIG Multicluster. size/XS Denotes a PR that changes 0-9 lines, ignoring generated files. labels Nov 25, 2025

MrFreezeex mentioned this pull request Nov 25, 2025

apis: conformance: add more conflict condition on asymetrical traffic kubernetes-sigs/mcs-api#132

Open

k8s-ci-robot assigned mikemorris Nov 25, 2025

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Nov 25, 2025

lauralorenz reviewed Dec 3, 2025

View reviewed changes

MrFreezeex force-pushed the KEP1645-port-ipfamilies-more-conflict branch from 3442bf2 to 639a31b Compare December 3, 2025 18:17

k8s-ci-robot removed the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Dec 3, 2025

MrFreezeex force-pushed the KEP1645-port-ipfamilies-more-conflict branch from 639a31b to 8f3993b Compare December 3, 2025 18:26

lauralorenz reviewed Dec 9, 2025

View reviewed changes

keps/sig-multicluster/1645-multi-cluster-services-api/README.md Outdated Show resolved Hide resolved

KEP 1645: add more conflict condition on asymetrical traffic

baf4e0c

Signed-off-by: Arthur Outhenin-Chalandre <[email protected]>

MrFreezeex force-pushed the KEP1645-port-ipfamilies-more-conflict branch from 8f3993b to baf4e0c Compare December 9, 2025 17:49

k8s-ci-robot assigned lauralorenz Dec 9, 2025

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Dec 9, 2025

tpantelis reviewed Dec 9, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

KEP 1645: add more conflict condition on asymetrical traffic #5706

KEP 1645: add more conflict condition on asymetrical traffic #5706

MrFreezeex commented Nov 25, 2025 •

edited

Loading

Uh oh!

k8s-ci-robot commented Nov 25, 2025

Uh oh!

mikemorris commented Nov 25, 2025

Uh oh!

zhiying-lin commented Nov 28, 2025

Uh oh!

lauralorenz Dec 3, 2025

Uh oh!

MrFreezeex Dec 3, 2025

Uh oh!

Uh oh!

lauralorenz commented Dec 9, 2025

Uh oh!

tpantelis Dec 9, 2025 •

edited

Loading

Uh oh!

MrFreezeex Dec 10, 2025 •

edited

Loading

Uh oh!

tpantelis Dec 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

KEP 1645: add more conflict condition on asymetrical traffic #5706

Are you sure you want to change the base?

KEP 1645: add more conflict condition on asymetrical traffic #5706

Conversation

MrFreezeex commented Nov 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

k8s-ci-robot commented Nov 25, 2025

Uh oh!

mikemorris commented Nov 25, 2025

Uh oh!

zhiying-lin commented Nov 28, 2025

Uh oh!

lauralorenz Dec 3, 2025

Choose a reason for hiding this comment

Uh oh!

MrFreezeex Dec 3, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

lauralorenz commented Dec 9, 2025

Uh oh!

tpantelis Dec 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

MrFreezeex Dec 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

tpantelis Dec 10, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

MrFreezeex commented Nov 25, 2025 •

edited

Loading

tpantelis Dec 9, 2025 •

edited

Loading

MrFreezeex Dec 10, 2025 •

edited

Loading