Matthew Booth
Matthew Booth
The documentation of this field is quite spectacularly bad: https://kubernetes.io/docs/concepts/services-networking/dns-pod-service/#pod-s-dns-policy > "ClusterFirstWithHostNet": For Pods running with hostNetwork, you should explicitly set its DNS policy to "ClusterFirstWithHostNet". Otherwise, Pods running with...
> @mdbooth occm wasn't able to query the internal coredns instance without this new value > > It matters because of specific internal dns zone with squid proxy that should...
> our CI passed so it should be a smaller portion of error case I also wondered about that. Does that mean CNI comes up an an uninitialized node, and...
It's probably too late to ask for logs on this one, but I wonder what the sequence of events here is. Presumably we're going to add the new control plane...
/remove-lifecycle rotten
/remove-lifecycle stale
As discussed in the PR, I don't think https://github.com/ceph/ceph/pull/54392 fixes the reported crash. I've submitted https://github.com/ceph/ceph/pull/54400, which I believe should fix it.
Incidentally, I notice this was reported against a test cluster, but I hit this on real hardware: https://lists.ceph.io/hyperkitty/list/[email protected]/thread/74KIM6WHKT7UMCGEL27JQJSWMAGQW7U6/
Ok. I think you'll need my patch first, though, because the code removed in https://github.com/ceph/ceph/pull/54392 can't be reached without my patch.
@travisn I've put my understanding of this issue in https://github.com/ceph/ceph/pull/54392#issuecomment-1802160948 btw. If you wanted to move forward with a fix in rook we could ignore an error return from the...