rabbitmq-server icon indicating copy to clipboard operation
rabbitmq-server copied to clipboard

Classic config peer discovery still might select two different seed nodes

Open dcorbacho opened this issue 1 year ago • 2 comments

Describe the bug

Sometimes in CI the testcase peer_discovery_classic_config_SUITE:successfully_discovery fails to cluster all 7 nodes. Six nodes join the same seed, and one other node selects itself as a seed and does not join the rest of the cluster.

The peer discovery seed selection needs to be reviewed

Reproduction steps

  1. It's a CI flake

Expected behavior

The 7 nodes join the cluster

Additional context

No response

dcorbacho avatar Nov 07 '24 10:11 dcorbacho

This was discussed relatively recently. Given that for this backend, the list of nodes is explicitly listed ahead of time, should we make the seed explicitly configurable, and use it in the tests?

We have tried different approaches over the years, such as "use the first node on a sorted list".

For example, Cassandra uses an explicit seed node configuration argument.

@dumbbell @mkuratczyk

michaelklishin avatar Nov 07 '24 17:11 michaelklishin

@dcorbacho: Do you have a link to a failing CI job or node logs?

dumbbell avatar Nov 11 '24 10:11 dumbbell