aztk icon indicating copy to clipboard operation
aztk copied to clipboard

Handle startup task failures

Open jaley opened this issue 7 years ago • 1 comments

I currently have issues with entire clusters failing to startup due to apt-get problems, presumably setting up the host:

E: Unable to locate package linux-image-extra-4.15.0-1013-azure
E: Couldn't find any package by glob 'linux-image-extra-4.15.0-1013-azure'
E: Couldn't find any package by regex 'linux-image-extra-4.15.0-1013-azure'

This is happening on aztk 0.7.0, and I'm hoping that updating to 0.8.0 will resolve the problem, but internal build problems on our side have blocked me from testing that so far.

Regardless, at the Aztk CLI, the behaviour is just to wait indefinitely when every node fails to start up like this. This makes it hard for us to detect when we have a lot of automated jobs running regularly.

jaley avatar Jun 15 '18 12:06 jaley

That package is no longer downloaded in 0.8.0, so the issue is likely going to be resolved if you update. Still, this should not be happening, so I will look into it.

jafreck avatar Jun 15 '18 17:06 jafreck