scylla-operator icon indicating copy to clipboard operation
scylla-operator copied to clipboard

Support persistent storage

Open tnozicka opened this issue 3 years ago • 3 comments

Is this a bug report or feature request?

  • Feature Request

What should the feature do:

  • Support PV backed by persistent storage
  • Add e2e variant for persistent storage

What is use case behind this feature: Some users don't always need the performance, e.g. for small or test clusters. Dedicated SSDs cost a lot and it requires additional setup to create the local static provisioner.

tnozicka avatar Jul 09 '21 15:07 tnozicka

We need to document that persistent storage won't work with hostNetworking because the pods won't stick to the nodes but they use their IPs for identity. This works only with local storage because PVs has a node affinity and pods will stick to it.

tnozicka avatar Aug 19 '21 05:08 tnozicka

Some noted that it may work if we use internal ips for membership, so it may just need verification.

tnozicka avatar Aug 19 '21 09:08 tnozicka

We are using the ClusterIP for identity so even with hostNetworking we wouldn't care about the Pod IP changing. I guess we just need some testing coverage.

tnozicka avatar Jan 26 '22 12:01 tnozicka

The Scylla Operator project currently lacks enough contributors to adequately respond to all issues.

This bot triages un-triaged issues according to the following rules:

  • After 30d of inactivity, lifecycle/stale is applied
  • After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
  • After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

  • Mark this issue as fresh with /remove-lifecycle stale
  • Close this issue with /close
  • Offer to help out

/lifecycle stale

/triage accepted let's add a test that chooses standard-rwo on GKE and runs our conformance suite

tnozicka avatar Jun 21 '24 11:06 tnozicka

let's add a test that chooses standard-rwo on GKE and runs our conformance suite

*-clusterip variant already runs on persistent disks (pd-ssd)

zimnx avatar Jun 21 '24 11:06 zimnx

To my knowledge none of our tests uses storageclass other then scylladb-local-xfs.

Are you talking about the disks used the VMs?

tnozicka avatar Jun 21 '24 12:06 tnozicka

I'm talking about disks used by scylladb-local-xfs. On parallel and serial jobs, scylladb-local-xfs is backed by localSSD, where on parallel-clusterip is backed by pd-ssd.

zimnx avatar Jun 21 '24 13:06 zimnx

I'm talking about disks used by scylladb-local-xfs. On parallel and serial jobs, scylladb-local-xfs is backed by localSSD, where on parallel-clusterip is backed by pd-ssd.

The point is it still uses our driver in that case. Having said that, what's the value in testing against a different storage class @tnozicka? We wouldn't encourage anyone to run with standard-rwo anyway, so why cover it?

rzetelskik avatar Jun 21 '24 13:06 rzetelskik

While we don't recommend it for performance reasons, people still use it. It's fine for instances that don't care about performance. The PVC lifecycle is a bit different and we should make sure we don't break conformance suite, nothing more.

tnozicka avatar Jun 24 '24 09:06 tnozicka