cassandra-medusa
cassandra-medusa copied to clipboard
Support (S3, GCP, Azure) storage classes
Medusa does not support specifying the storage class name when uploading backups to S3/GCP/Azure. This is very important for many customers as it can help to reduce the storage cost.
Closes #568
Quality Gate passed
Issues
0 New issues
0 Accepted issues
Measures
0 Security Hotspots
No data about Coverage
0.0% Duplication on New Code
hi everyone, any help needed here? this feature is very interesting and would like to try using it ASAP
Hi, We tried the following steps to add the storage_class. We are using AWS S3 bucket for storing the backup files.
- Upgraded the Medusa version to 0.21.0 from 0.17.2.
- Added the storage_class parameter to the medusa.ini file.
- We updated config.py, abstract_storage.py, s3_base_storage.py files accordingly.
- Ran differential backup.
The backup was successful. But it is taking 1 hour to complete. The previous backups would finish within 2-5minutes. We observed that the manifest.json file is taking more time. Can you please let us know what might be the issue?
I've implemented the suggested changes and added integration tests over at https://github.com/thelastpickle/cassandra-medusa/pull/777/checks
Hi,
Below are the tests done on the cluster:
Old Medusa Version: 0.17.2 New Medusa Version: 0.21.0 Storage_class: STANDARD_IA
Test1 (New Medusa version): New bucket, storage_class parameter in the medusa.ini file Started backup. The 2nd backup took the same time as the first backup around 50 minutes.
Test2 (old Medusa version): New bucket, storage_class parameter in the medusa.ini file Started backup. The backup was successful, 1st backup took 50 minutes to complete. The 2nd backup was done within 1minute.
Test3 (old Medusa version): New bucket, storage_class parameter in the medusa.ini file Modified config.py, abstract_storage.py, s3_base_storage.py Started backup. The backup was successful, 1st backup took 6 minutes to complete. The 2nd backup was done within 1minute. (There are few backups in the bucket while taking this backup.)
Test4 (old Medusa version): New bucket, storage_class parameter in the medusa.ini file Modified config.py, abstract_storage.py, s3_base_storage.py Started backup with the empty new bucket. The backup was successful, 1st backup took 50mins to complete. The 2nd backup was done within 1minute.
Can you please let us know why the New Medusa version with STANDARD_IA is taking more time?
Thanks, Kanthi Rekha.