azure-container-networking icon indicating copy to clipboard operation
azure-container-networking copied to clipboard

fix: CNS should not mark itself as ready if it can't write out a conflist.

Open paulgmiller opened this issue 1 month ago • 2 comments

Reason for Change: CNS should not signal its healthz till it can write out a conflist. Going ready

  1. makes it it harder for both engineers and systems to know CNS isn't getting to a good state.
  2. makes it easier for a bad rollout to takea bug to all nodes.

Refactors

  1. This moves alot of ncsync logic out of internalapi.go to synchostnc.go
  2. It adds a new wait group like object (networkContainerSyncState) to httprestservice
  3. Changes direct calls/loops in main to SyncHostNCVersion to use StartSyncHostNCVersionLoop for consistencey and better mangemetn of starting wait
  4. has nodesubent also startwat/signal readiness and not write the conflist direcl
  5. moves conf list writing to a wait function that also makes sure waits on nc (nodesubbnet and syncHostNCVersion) have signaled.

Tries to hide as much as possible to make movign to subpackage easier in future but pretty tangled with HttpRestServer still.

Notes:

paulgmiller avatar Nov 25 '25 01:11 paulgmiller

/azp run Azure Container Networking PR

rbtr avatar Dec 01 '25 17:12 rbtr

Azure Pipelines successfully started running 1 pipeline(s).

azure-pipelines[bot] avatar Dec 01 '25 17:12 azure-pipelines[bot]

This pull request is stale because it has been open for 2 weeks with no activity. Remove stale label or comment or this will be closed in 7 days

github-actions[bot] avatar Dec 16 '25 00:12 github-actions[bot]

Pull request closed due to inactivity.

github-actions[bot] avatar Dec 24 '25 00:12 github-actions[bot]