DAOS-17736 rebuild: exit the whole rebuild when one obj rebuild failed
When one object rebuild failed, exit the whole rebuild to avoid pool destroy timeout. After rebuild done, change rank domain's status from DOWN to DOWNOUT.
Steps for the author:
- [ ] Commit message follows the guidelines.
- [ ] Appropriate Features or Test-tag pragmas were used.
- [ ] Appropriate Functional Test Stages were run.
- [ ] At least two positive code reviews including at least one code owner from each category referenced in the PR.
- [ ] Testing is complete. If necessary, forced-landing label added and a reason added in a comment.
After all prior steps are complete:
- [ ] Gatekeeper requested (daos-gatekeeper added as a reviewer).
Ticket title is 'erasurecode/online_rebuild_mdtest.py pool destroy timedout' Status is 'Awaiting backport' Labels: 'scrubbed_2.6.5' Job should run at elevated priority (1) https://daosio.atlassian.net/browse/DAOS-17736
Test stage Functional on EL 8.8 completed with status UNSTABLE. https://jenkins-3.daos.hpc.amslabs.hpecorp.net/job/daos-stack/job/daos//view/change-requests/job/PR-16543/2/testReport/
Test stage Functional Hardware Large completed with status FAILURE. https://jenkins-3.daos.hpc.amslabs.hpecorp.net//job/daos-stack/job/daos/view/change-requests/job/PR-16543/4/execution/node/1531/log
Test stage Functional Hardware Medium Verbs Provider completed with status FAILURE. https://jenkins-3.daos.hpc.amslabs.hpecorp.net//job/daos-stack/job/daos/view/change-requests/job/PR-16543/5/execution/node/1392/log
Test stage Functional Hardware Large completed with status FAILURE. https://jenkins-3.daos.hpc.amslabs.hpecorp.net//job/daos-stack/job/daos/view/change-requests/job/PR-16543/5/execution/node/1528/log
Test stage Build RPM on Leap 15.5 completed with status UNSTABLE. https://jenkins-3.daos.hpc.amslabs.hpecorp.net//job/daos-stack/job/daos/view/change-requests/job/PR-16543/8/execution/node/334/log
Test stage Build RPM on EL 8 completed with status UNSTABLE. https://jenkins-3.daos.hpc.amslabs.hpecorp.net//job/daos-stack/job/daos/view/change-requests/job/PR-16543/8/execution/node/360/log
@daos-stack/daos-gatekeeper clang format only one warning is
- return pool_update_map_internal(pool_uuid, MAP_FINISH_REBUILD, true, list, NULL, NULL,
-
NULL, NULL, reclaim_ver, NULL);
- return pool_update_map_internal(pool_uuid, MAP_FINISH_REBUILD, true, list, NULL, NULL, NULL,
I can change it in a following PR to avoid CI re-test.