racing icon indicating copy to clipboard operation
racing copied to clipboard

pgo storage fills up too quick / different psql operator?

Open durandom opened this issue 2 years ago • 4 comments

because backups are eating up our storage: https://github.com/CrunchyData/postgres-operator/issues/2531

Consider migration to https://cloudnative-pg.io/

https://blog.palark.com/cloudnativepg-and-other-kubernetes-operators-for-postgresql/

durandom avatar May 09 '23 14:05 durandom

either increasing the backup frequency might help as in https://github.com/CrunchyData/postgres-operator/issues/2531#issuecomment-922349084

or setting archive_mode to off as in https://github.com/CrunchyData/postgres-operator/issues/2531#issuecomment-1022070211

durandom avatar May 09 '23 15:05 durandom

/priority backlog

goern avatar May 25 '23 17:05 goern

Extend the PVC for psql by +2Gb and wait until the DB recovers. Then trigger a one off backup either by:

oc annotate postgrescluster db --overwrite postgres-operator.crunchydata.com/pgbackrest-backup="$(date)"

or oc create job --from=cronjob/db-repo1-full db-backup-full

Long term we need either

  • monitoring of the psql PVC (but this happens fast)
  • a setup that can tolerate failing backups
  • less WAL for psql

durandom avatar Sep 07 '23 12:09 durandom

implemented some altering, see https://github.com/b4mad/racing/blob/main/manifests/env/phobos/postgresql/alerting.yaml

alertmanager will send to https://discord.com/channels/850841654327640074/1106466539273732136

goern avatar Sep 13 '23 14:09 goern