Kubernetes-Master-Class icon indicating copy to clipboard operation
Kubernetes-Master-Class copied to clipboard

Recovering from `rke etcd snapshot-restore` failures

Open VltraHeaven opened this issue 2 years ago • 0 comments

Recovering from rke etcd snapshot-restore failures

If an error is encountered while performing an etcd restoration from a snapshot, it's possible for failures to persist even after remediating the root cause. To avoid this behavior, it's recommended to clean up one or all the pods that were created during the rke etcd snapshot-restore process prior to performing subsequent restore attempts. Following is a list of applicable pods that are created during the etcd restoral process:

  • etcd-restore
  • etcd-checksum-checker
  • etcd-download-backup
  • etcd-Serve-backup
  • etcd-extract-statefile

References

  • https://github.com/rancher/rke/blob/397ffdbb81177c7b524695f40be48d5d8f999258/services/services.go#L34

VltraHeaven avatar Jan 28 '22 00:01 VltraHeaven