daos icon indicating copy to clipboard operation
daos copied to clipboard

DAOS-17492 control: Ensure updated members can become voters

Open kjacque opened this issue 9 months ago • 5 comments

When adding a new access point to config and restarting, the member is updated, not added, so it was not being considered a voter in the MS leader election.

Features: control

Steps for the author:

  • [x] Commit message follows the guidelines.
  • [x] Appropriate Features or Test-tag pragmas were used.
  • [ ] Appropriate Functional Test Stages were run.
  • [ ] At least two positive code reviews including at least one code owner from each category referenced in the PR.
  • [ ] Testing is complete. If necessary, forced-landing label added and a reason added in a comment.

After all prior steps are complete:

  • [ ] Gatekeeper requested (daos-gatekeeper added as a reviewer).

kjacque avatar May 16 '25 00:05 kjacque

Ticket title is 'Aurora user/perf: Admin (dmg) command does not work from newly added access_point ' Status is 'In Review' Labels: 'ALCF,alcf_cluster,alcf_track' https://daosio.atlassian.net/browse/DAOS-17492

github-actions[bot] avatar May 16 '25 00:05 github-actions[bot]

Test stage Functional Hardware Medium Verbs Provider MD on SSD completed with status FAILURE. https://jenkins-3.daos.hpc.amslabs.hpecorp.net//job/daos-stack/job/daos/view/change-requests/job/PR-16392/6/execution/node/1337/log

daosbuild3 avatar May 29 '25 01:05 daosbuild3

Test stage NLT on EL 8.8 completed with status FAILURE. https://jenkins-3.daos.hpc.amslabs.hpecorp.net/job/daos-stack/job/daos/job/PR-16392/7/display/redirect

daosbuild3 avatar Jun 06 '25 14:06 daosbuild3

Test stage Functional Hardware Medium Verbs Provider MD on SSD completed with status FAILURE. https://jenkins-3.daos.hpc.amslabs.hpecorp.net//job/daos-stack/job/daos/view/change-requests/job/PR-16392/8/execution/node/1468/log

daosbuild3 avatar Jun 12 '25 15:06 daosbuild3

Test stage Functional Hardware Medium MD on SSD completed with status UNSTABLE. https://jenkins-3.daos.hpc.amslabs.hpecorp.net/job/daos-stack/job/daos//view/change-requests/job/PR-16392/8/testReport/

daosbuild3 avatar Jun 12 '25 15:06 daosbuild3

Test stage Functional Hardware Large MD on SSD completed with status FAILURE. https://jenkins-3.daos.hpc.amslabs.hpecorp.net//job/daos-stack/job/daos/view/change-requests/job/PR-16392/9/execution/node/1545/log

daosbuild3 avatar Jul 10 '25 13:07 daosbuild3

CI run was actually all green, the failure is a known infrastructure issue: https://daosio.atlassian.net/browse/SRE-3228

kjacque avatar Jul 24 '25 14:07 kjacque