daos icon indicating copy to clipboard operation
daos copied to clipboard

DAOS-17532 test: Remove local host client use from functional tests

Open shimizukko opened this issue 9 months ago • 16 comments

Add "test_clients: 1" to the test yaml for the tests that use client such as creating a container.

Some tests are already using up the available nodes, so update the test yaml to use dual engines per node to make room for the client node.

container/per_server_fault_domain needs to move to Hardware Large.

Skip-unit-tests: true Skip-fault-injection-test: true Skip-func-hw-test-medium-md-on-ssd: false Test-tag: ContainerAPIBasicAttributeTest AutoOCSelectionTest BasicSnapshot ContainerDestroyTest BoundaryPoolContainerSpace ContainerLabelTest OpenContainerTest ContainerQueryAttributeTest QueryPropertiesTest Snapshot DaosObjectQuery DaosSnapshotTest DmgSystemCleanupTest DFuseBashdcacheTest DFuseFdTest DfuseBashCmd DfuseEnospace ILWhiteList RootContainerTest BasicTxTest CreateManyDkeys ObjFetchBadParam ObjectDataValidation ObjOpenBadParam PunchTest SameKeyDifferentValue ObjUpdateBadParam PoolEvictTest PoolPDAProperty Permission RbldBasic RbldWithIO ContainerCleanupTest ContainerListConsolidationTest DdbTest PoolMembershipTest CreateContainterACLTest DkeyAkeyEnumPunch

Steps for the author:

  • [ ] Commit message follows the guidelines.
  • [ ] Appropriate Features or Test-tag pragmas were used.
  • [ ] Appropriate Functional Test Stages were run.
  • [ ] At least two positive code reviews including at least one code owner from each category referenced in the PR.
  • [ ] Testing is complete. If necessary, forced-landing label added and a reason added in a comment.

After all prior steps are complete:

  • [ ] Gatekeeper requested (daos-gatekeeper added as a reviewer).

shimizukko avatar May 28 '25 05:05 shimizukko

Ticket title is 'Remove local host client use from functional tests' Status is 'In Progress' https://daosio.atlassian.net/browse/DAOS-17532

github-actions[bot] avatar May 28 '25 05:05 github-actions[bot]

Test stage Functional on EL 8.8 completed with status UNSTABLE. https://jenkins-3.daos.hpc.amslabs.hpecorp.net/job/daos-stack/job/daos//view/change-requests/job/PR-16444/1/testReport/

daosbuild3 avatar May 28 '25 21:05 daosbuild3

control/dmg_system_cleanup.py failed with

18:31:31 ERROR| Reproduced traceback from: /localhome/jenkins/venv/lib64/python3.6/site-packages/avocado/core/test.py:767
18:31:31 ERROR| Traceback (most recent call last):
18:31:31 ERROR|   File "/usr/lib/daos/TESTING/ftest/control/dmg_system_cleanup.py", line 84, in test_dmg_system_cleanup_one_host
18:31:31 ERROR|     self.fail("Wrote to container #{} when it should have failed".format(idx))
18:31:31 ERROR|   File "/usr/lib/daos/TESTING/ftest/util/apricot/apricot/test.py", line 1382, in fail
18:31:31 ERROR|     super().fail(message)
18:31:31 ERROR|   File "/localhome/jenkins/venv/lib64/python3.6/site-packages/avocado/core/test.py", line 953, in fail
18:31:31 ERROR|     raise exceptions.TestFail(message)
18:31:31 ERROR| avocado.core.exceptions.TestFail: Wrote to container #0 when it should have failed

This is weekly test and I want to check the weekly results, but it hasn't run yet.

shimizukko avatar May 29 '25 22:05 shimizukko

Test stage Functional on EL 8.8 completed with status UNSTABLE. https://jenkins-3.daos.hpc.amslabs.hpecorp.net/job/daos-stack/job/daos//view/change-requests/job/PR-16444/2/testReport/

daosbuild3 avatar Jun 08 '25 01:06 daosbuild3

Test stage Functional on EL 8.8 completed with status UNSTABLE. https://jenkins-3.daos.hpc.amslabs.hpecorp.net/job/daos-stack/job/daos//view/change-requests/job/PR-16444/7/testReport/

daosbuild3 avatar Jun 12 '25 05:06 daosbuild3

Test stage Functional on EL 8.8 completed with status UNSTABLE. https://jenkins-3.daos.hpc.amslabs.hpecorp.net/job/daos-stack/job/daos//view/change-requests/job/PR-16444/9/testReport/

daosbuild3 avatar Jun 16 '25 06:06 daosbuild3

Test stage Functional Hardware Medium MD on SSD completed with status UNSTABLE. https://jenkins-3.daos.hpc.amslabs.hpecorp.net/job/daos-stack/job/daos//view/change-requests/job/PR-16444/10/testReport/

daosbuild3 avatar Jun 16 '25 15:06 daosbuild3

Test stage Functional Hardware Medium MD on SSD completed with status FAILURE. https://jenkins-3.daos.hpc.amslabs.hpecorp.net//job/daos-stack/job/daos/view/change-requests/job/PR-16444/11/execution/node/848/log

daosbuild3 avatar Jun 17 '25 01:06 daosbuild3

Test stage Functional Hardware Medium MD on SSD completed with status FAILURE. https://jenkins-3.daos.hpc.amslabs.hpecorp.net//job/daos-stack/job/daos/view/change-requests/job/PR-16444/12/execution/node/848/log

daosbuild3 avatar Jun 17 '25 06:06 daosbuild3

Test stage Functional on EL 8.8 completed with status UNSTABLE. https://jenkins-3.daos.hpc.amslabs.hpecorp.net/job/daos-stack/job/daos//view/change-requests/job/PR-16444/14/testReport/

daosbuild3 avatar Jun 18 '25 19:06 daosbuild3

Test stage Functional Hardware Medium MD on SSD completed with status UNSTABLE. https://jenkins-3.daos.hpc.amslabs.hpecorp.net/job/daos-stack/job/daos//view/change-requests/job/PR-16444/15/testReport/

daosbuild3 avatar Jun 20 '25 07:06 daosbuild3

Test stage Functional Hardware Large MD on SSD completed with status FAILURE. https://jenkins-3.daos.hpc.amslabs.hpecorp.net/job/daos-stack/job/daos/job/PR-16444/16/display/redirect

daosbuild3 avatar Jun 22 '25 13:06 daosbuild3

Test stage Functional Hardware Large MD on SSD completed with status UNSTABLE. https://jenkins-3.daos.hpc.amslabs.hpecorp.net/job/daos-stack/job/daos//view/change-requests/job/PR-16444/17/testReport/

daosbuild3 avatar Jun 23 '25 01:06 daosbuild3

countainer/boundary.py failed with server count 4, but passed with 7.

shimizukko avatar Jun 23 '25 07:06 shimizukko

Test stage Functional Hardware Large MD on SSD completed with status FAILURE. https://jenkins-3.daos.hpc.amslabs.hpecorp.net/job/daos-stack/job/daos/job/PR-16444/19/display/redirect

daosbuild3 avatar Jun 23 '25 11:06 daosbuild3

Looks like the smallest server count that makes countainer/boundary.py pass is 5.

shimizukko avatar Jun 26 '25 01:06 shimizukko

@daltonbohning @dinghwah I missed some tags, so some tests didn't run. I also had to fix some failures. Now all tests were executed and passed.

shimizukko avatar Jul 04 '25 07:07 shimizukko

@daltonbohning Please use the description at the top of this page when you merge. The original commit message is outdated. This PR is to improve the tests and not to fix failing tests.

shimizukko avatar Jul 07 '25 20:07 shimizukko