infrastructure icon indicating copy to clipboard operation
infrastructure copied to clipboard

Orka mac x64 nodes don't always initialize successfully or take a very long time to...

Open andrew-m-leonard opened this issue 1 year ago • 1 comments

We have seen hung/timeout mac x64 Orka requests, eg: https://ci.adoptium.net/job/Test_openjdk22_hs_sanity.openjdk_x86-64_mac/30/console

14:21:55  Could not find any nodes with 'ci.role.test&&hw.arch.x86&&(sw.os.osx||sw.os.mac)&&!sw.os.osx.10_14' label
[Pipeline] timeout
14:21:55  Timeout set to expire in 5 min 0 sec
[Pipeline] {
[Pipeline] node
14:22:10  Still waiting to schedule task
14:22:10  All nodes of label ‘[ci.role.test&&hw.arch.x86&&(sw.os.osx||sw.os.mac)&&!sw.os.osx.10_14](https://ci.adoptium.net/label/ci.role.test&&hw.arch.x86&&(sw.os.osx%7C%7Csw.os.mac)&&!sw.os.osx.10_14/)’ are offline
14:26:55  Cancelling nested steps due to timeout

From watching the Jenkins node view, I can see the Orka Cloud initiates several times to attempt to start a VM, but each time the node fails to start the agent with message:

Waiting for SSH to be enabled

It eventually seems to give up, and try starting a new Orka VM.

@gdams I think we may need Orka support to take a look? again I think this is only an x64Mac problem.

andrew-m-leonard avatar Mar 21 '24 14:03 andrew-m-leonard

Ref #2536 which is still open until the machines are confirmed as fully functional ;-)

sxa avatar Mar 21 '24 17:03 sxa