Child jobs are invisible in Nomad UI once parent jobs are stopped/GC'd
Nomad version
1.8.10 LTS
Operating system and Environment details
N/A
Issue
Running/pending child jobs are completely invisible in the Nomad UI if the parent job has been stopped and garbage collected.
As described in https://github.com/hashicorp/nomad/issues/24669, when a parent job is stopped any previously dispatched parameterized child jobs, or pending periodic jobs are not also stopped and are left behind. Once the parent job has been garbage collected, these no longer show up in the UI at all (likely because the main jobs list only shows parent jobs).
I've discovered this in two ways recently:
-
nomad setup vault -checkreports jobs needing migration to workload identities which are invisible in the UI - Attempts to delete a namespace failed while no jobs were visible in the UI.
Not being able to see child jobs left behind when parent job is stopped makes it hard to cleanup left behind jobs, and leaves nasty surprises which disrupt future maintenance tasks.
Reproduction steps
- Create a parameterized job
- Dispatch an instance of the parameterized job
- Stop and purge the parent job
- Parameterized job now cannot be accessed by clicking around the UI starting at the jobs list view
Expected Result
Nomad UI should allow to list/view all running jobs. Child jobs without a parent job might need a placeholder entry in the main jobs list.
Actual Result
No entry in the main jobs list for either the parent job (since GC'd), or the child job.
Hi @optiz0r! That's a real pain for sure. I don't have a good workaround for you. I've marked this for roadmapping. (Internal ref https://hashicorp.atlassian.net/browse/NMD-1021)