percona-server-mongodb-operator Performance degradation in large deployments after upgrade from PSMDB operator 1.14.0 to 1.16.1

Performance degradation in large deployments after upgrade from PSMDB operator 1.14.0 to 1.16.1

Open MatzeScandio opened this issue 5 months ago • 4 comments

Report

Performance degradation in large deployments after upgrade from PSMDB operator 1.14.0 to 1.16.1

we tested the upgrade in our DEV environment and did not see any issues with performance
After upgrading the operator in our PROD environment we noticed a significant slowdown
the PROD environment is significantly larger with ~90 PSMDBs and a retention period of 30 days resulting in 2700 psmdb-backup objects
while the creation of a new database in version 1.14.0 took about 5 minutes it took ~6 hours to create a new psmdb database with operator version 1.16.1__

Steps to reproduce

create 5 databases, enable backups for each and create a backup task named 'daily' and set the keep attribute to something above 0
monitor the kubernetes API calls for psmdb-backup resources
for each reconcile call of the psmdb object there should be 5 requests to the API: /apis/psmdb.percona.com/v1/namespaces/mongodb/perconaservermongodbbackups?labelSelector=ancestor%3Ddaily%2Ccluster%3D<db-name>

With just 5 databases and a limited number of backups this will of course not result in a slowdown, but you will be able to see the repeated calls to the API endpoint.

Alternatively

create 5 databases, enable backups for each and use unique names this time. set the keep attribute to something above 0
in the logs you should see a lot of 'deleting outdated backup job' events (4 log lines per psmdb reconcile call)

Versions

Kubernetes: AWS-EKS 1.24
Operator: 1.16.1
Database: 5.0.23-20

Anything else?

feel free to ask in case of any unclarities

Aug 30 '24 09:08 MatzeScandio

percona-server-mongodb-operator
percona-server-mongodb-operator copied to clipboard

Performance degradation in large deployments after upgrade from PSMDB operator 1.14.0 to 1.16.1

Report

More about the problem

Analysis

Workaround

Steps to reproduce

Alternatively

Versions

Anything else?

percona-server-mongodb-operator percona-server-mongodb-operator copied to clipboard

Performance degradation in large deployments after upgrade from PSMDB operator 1.14.0 to 1.16.1

Report

More about the problem

Analysis

Workaround

Steps to reproduce

Alternatively

Versions

Anything else?

percona-server-mongodb-operator
percona-server-mongodb-operator copied to clipboard