Tim Gross
Tim Gross
While debugging a community issue (https://github.com/hashicorp/nomad/issues/14200) I noticed that if you `nomad job run` a job with a manual canary promotion, the job monitor will hang because it's awaiting manual...
In #15417 we added a new `Authenticate` method to the server that returns an `AuthenticatedIdentity` struct. This changeset implements this method for a small number of RPC endpoints that together...
Implement a metric for RPC requests with labels on the identity, so that administrators can monitor the source of requests within the cluster. This changeset demonstrates the change with the...
Once https://github.com/hashicorp/nomad/issues/13219 is complete, Nomad will run autopilot metrics on all servers. This could let us have health checks that specifically check that a server has completed restoring its initial...
Large clusters can have allocations and deployments in various failure states on a regular basis. Because evaluations are idempotent over the state, re-running an evaluation for service and system jobs...
In addition to lacking pagination (see https://github.com/hashicorp/nomad/issues/15479), the `operator debug` command takes the results of list queries and serializes them to a JSON array-of-objects. This format is not suitable for...
As noted in [Architecture: Eval Lifecycle](https://github.com/hashicorp/nomad/blob/main/contributing/architecture-eval-lifecycle.md#scheduling), the scheduler has 3 phases: reconciling, feasibility checking, and scoring. When a plan is submitted, it includes metrics for feasibility checking and scoring, but...
When new servers join the cluster, they stream a raft snapshot from the existing servers to catch up for replication. But many other operations are spinning up concurrently, including scheduling....
While working on some internal testing which resulted in discovering #15552, @anth0d pointed out we're missing a test as noted in this very old TODO: https://github.com/hashicorp/nomad/blob/v1.4.3/api/agent_test.go#L137. Opening this issue to...
Some consumers of `consul-template` use it like a library, where the application runs the runner in-process. For projects like Nomad which need to run with a high level of privilege,...