Keep exclusive queues with Khepri + network partition #14573

dumbbell · 2025-09-19T11:38:00Z

Why

With Mnesia, when the network partition strategy is set to pause_minority, nodes on the "minority side" are stopped.

Thus, the exclusive queues that were hosted by nodes on that minority side are lost:

Consumers connected on these nodes are disconnected because the nodes are stopped.
Queue records on the majority side are deleted from the metadata store.

This was ok with Mnesia and how this network partition handling strategy is implemented. However, it does not work with Khepri because the nodes on the "minority side" continue to run and serve clients. Therefore the cluster ends up in a weird situation:

The "majority side" deleted the queue records.
When the network partition is solved, the "minority side" gets the record deletion, but the queue processes continue to run.

How

With Khepri, we stop to delete transient queue records in general, just because there is a node going down. Thanks to this, an exclusive queue and its consumer are not affected by a network partition: they continue to work.

However, if a node is really lost, we need to clean up dead queue records. This was already done for durable queues with both Mnesia and Khepri. But with Khepri, transient queue records persist in the store like durable queue records (unlike with Mnesia).

That's why this commit changes the clean-up function, rabbit_amqqueue:forget_all_durable/1 into
rabbit_amqqueue:forget_all/1 which deletes all queue records of queues that were hosted on the given node, regardless if they are transient or durable.

Fixes #12949, #12597.

[Why] With Mnesia, when the network partition strategy is set to `pause_minority`, nodes on the "minority side" are stopped. Thus, the exclusive queues that were hosted by nodes on that minority side are lost: * Consumers connected on these nodes are disconnected because the nodes are stopped. * Queue records on the majority side are deleted from the metadata store. This was ok with Mnesia and how this network partition handling strategy is implemented. However, it does not work with Khepri because the nodes on the "minority side" continue to run and serve clients. Therefore the cluster ends up in a weird situation: 1. The "majority side" deleted the queue records. 2. When the network partition is solved, the "minority side" gets the record deletion, but the queue processes continue to run. [How] With Khepri, we stop to delete transient queue records in general, just because there is a node going down. Thanks to this, an exclusive queue and its consumer are not affected by a network partition: they continue to work. However, if a node is really lost, we need to clean up dead queue records. This was already done for durable queues with both Mnesia and Khepri. But with Khepri, transient queue records persist in the store like durable queue records (unlike with Mnesia). That's why this commit changes the clean-up function, `rabbit_amqqueue:forget_all_durable/1` into `rabbit_amqqueue:forget_all/1` which deletes all queue records of queues that were hosted on the given node, regardless if they are transient or durable. Fixes #12949, #12597.

dumbbell requested review from kjnilsson and mkuratczyk September 19, 2025 11:38

dumbbell self-assigned this Sep 19, 2025

dumbbell force-pushed the fix-exclusive-queues-with-khepri branch from dd66a56 to 2b31b23 Compare September 19, 2025 11:57

dumbbell force-pushed the fix-exclusive-queues-with-khepri branch from 2b31b23 to 7cc220b Compare September 19, 2025 13:01

the-mikedavis linked an issue Sep 19, 2025 that may be closed by this pull request

With Khepri, transient exclusive queues are not deleted in case of a partition #12597

Open

michaelklishin added the backport-v4.2.x label Sep 19, 2025

michaelklishin added this to the 4.3.0 milestone Sep 19, 2025

michaelklishin mentioned this pull request Sep 19, 2025

Exclusive queues can be deleted without the consumers being notified #12949

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Keep exclusive queues with Khepri + network partition #14573

Keep exclusive queues with Khepri + network partition #14573

Uh oh!

dumbbell commented Sep 19, 2025

Uh oh!

Uh oh!

Keep exclusive queues with Khepri + network partition #14573

Are you sure you want to change the base?

Keep exclusive queues with Khepri + network partition #14573

Uh oh!

Conversation

dumbbell commented Sep 19, 2025

Why

How

Uh oh!

Uh oh!