AKS
AKS copied to clipboard
Failed to setup network for sandbox Failed to create endpoint: TransparentEndpointClient Error : operation not supported
After node restart pods not starting and hangs in Unknows status kubelet drops the following messages:
Failed to create pod sandbox: rpc error: code = Unknown desc = failed to setup network for sandbox "0bc75e4c9f785c657c13bf1ad27a6e8f652ec01c8b48042acf2a27067edfa2e7": plugin type="azure-vnet" failed (add): Failed to create endpoint: TransparentEndpointClient Error : operation not supported
Here is how the pods looks like (all daemonsets):
Pods which I tried to recreate are stuck in ContainerCreating status.
Noticed that problematic node has newer kernel version:
Restarting node, kubelet service or azure-cns pods does not helps.
Any thoughts?
Seems issue is related to the following https://github.com/Azure/azure-container-networking/issues/2156
Same here! Some deploys have worked, but some others don't
Updating: After restart cluster node seems to fix and now works fine
Same issue also exists in the kernel version 6.2.0-1015-azure.
For me the error was - [(combined from similar events): Failed to create pod sandbox: rpc error: code = Unknown desc = failed to setup network for sandbox]
I was running AKS - 1.26.3 and Azure CNI.
After restarting the cluster from portal, this issue got resolved.
Same issue with AKS 1.26.10 and Azure CNI
For me node re-image solved the issue https://learn.microsoft.com/en-us/cli/azure/vmss?view=azure-cli-latest#az-vmss-reimage
I have encountered the same issue. Pod scheduled on a particular node (which had a kernel version 6.x) kept crashing and the whole cluster was wonky.
The solution to the problem was to upgrade the VMSS to the latest scale set model (stackoverflow). After upgrading to latest, the kernel version changed to 5.15.0-1057-azure and all works OK now.
CNI with fix has been released several months back. You may need to upgrade your VMSS to get new CNI for clusters using nodesubnet. Issue tracked in CNI side - https://github.com/Azure/azure-container-networking/issues/2156
Action required from @aritraghosh, @julia-yin, @AllenWen-at-Azure
Issue needing attention of @Azure/aks-leads
Issue needing attention of @Azure/aks-leads
Issue needing attention of @Azure/aks-leads
Issue needing attention of @Azure/aks-leads
Issue needing attention of @Azure/aks-leads
Issue needing attention of @Azure/aks-leads
Closing the issue per comment from @tamilmani1989