daos icon indicating copy to clipboard operation
daos copied to clipboard

DAOS-17738 client: reset DTX base UUID after fork - b26

Open Nasf-Fan opened this issue 8 months ago • 2 comments

To avoid parent and child threads generating the same DTX ID.

It also changes vos_dtx logic to avoid assertion when client reuses some DTX ID.

Steps for the author:

  • [ ] Commit message follows the guidelines.
  • [ ] Appropriate Features or Test-tag pragmas were used.
  • [ ] Appropriate Functional Test Stages were run.
  • [ ] At least two positive code reviews including at least one code owner from each category referenced in the PR.
  • [ ] Testing is complete. If necessary, forced-landing label added and a reason added in a comment.

After all prior steps are complete:

  • [ ] Gatekeeper requested (daos-gatekeeper added as a reviewer).

Nasf-Fan avatar Jun 25 '25 04:06 Nasf-Fan

Ticket title is 'daos rebuild cluster has some asserted engines with dtx_cmt_ent_update() Assertion 'dce_new->dce_reindex'' Status is 'In Review' Labels: 'ALCF,alcf_track,hpe_cluster' Job should run at elevated priority (1) https://daosio.atlassian.net/browse/DAOS-17738

github-actions[bot] avatar Jun 25 '25 04:06 github-actions[bot]

Test stage Functional Hardware Large completed with status FAILURE. https://jenkins-3.daos.hpc.amslabs.hpecorp.net//job/daos-stack/job/daos/view/change-requests/job/PR-16540/2/execution/node/1360/log

daosbuild3 avatar Jun 25 '25 23:06 daosbuild3

Test stage Functional Hardware Medium completed with status FAILURE. https://jenkins-3.daos.hpc.amslabs.hpecorp.net//job/daos-stack/job/daos/view/change-requests/job/PR-16540/4/execution/node/1472/log

daosbuild3 avatar Jul 03 '25 03:07 daosbuild3

Test stage Test RPMs on EL 8.6 completed with status FAILURE. https://jenkins-3.daos.hpc.amslabs.hpecorp.net/job/daos-stack/job/daos/job/PR-16540/13/display/redirect

daosbuild3 avatar Jul 21 '25 01:07 daosbuild3

Test stage Test RPMs on EL 8.6 completed with status FAILURE. https://jenkins-3.daos.hpc.amslabs.hpecorp.net/job/daos-stack/job/daos/job/PR-16540/14/display/redirect

daosbuild3 avatar Jul 21 '25 07:07 daosbuild3

Test stage Test RPMs on EL 8.6 completed with status FAILURE. https://jenkins-3.daos.hpc.amslabs.hpecorp.net/job/daos-stack/job/daos/job/PR-16540/15/display/redirect

daosbuild3 avatar Jul 21 '25 11:07 daosbuild3