thanos icon indicating copy to clipboard operation
thanos copied to clipboard

Zone aware replication

Open brancz opened this issue 4 years ago • 9 comments
trafficstars

Is your proposal related to a problem?

Allow not just replication factor and quorum to be a deciding factor, but also that this quorum must be across availability zones.

Describe the solution you'd like

Make Thanos aware of the availability zones receive nodes are in, and allow quorum to pass if and only if ceil(replication-factor/2) availability zones were successfully replicated to.

Describe alternatives you've considered

Leaving as is, or not using Thanos. Leaving as is does not tolerate an entire zone outage though, which is not included in most cloud provider's SLOs (as in cloud providers say, that there is nothing wrong with an entire availability zone being unavailable if the rest works).

@thanos-io/thanos-maintainers

brancz avatar Feb 22 '21 13:02 brancz

We could immediately benefit from it in our setup. We have been planning to support multi-cluster deployments.

kakkoyun avatar Feb 22 '21 14:02 kakkoyun

Hello 👋 Looks like there was no activity on this issue for the last two months. Do you mind updating us on the status? Is this still reproducible or needed? If yes, just comment on this PR or push a commit. Thanks! 🤗 If there will be no activity in the next two weeks, this issue will be closed (we can always reopen an issue if we need!). Alternatively, use remind command if you wish to be reminded at some point in future.

stale[bot] avatar Jun 03 '21 02:06 stale[bot]

Still valid.

kakkoyun avatar Jun 03 '21 09:06 kakkoyun

Hello 👋 Looks like there was no activity on this issue for the last two months. Do you mind updating us on the status? Is this still reproducible or needed? If yes, just comment on this PR or push a commit. Thanks! 🤗 If there will be no activity in the next two weeks, this issue will be closed (we can always reopen an issue if we need!). Alternatively, use remind command if you wish to be reminded at some point in future.

stale[bot] avatar Aug 02 '21 18:08 stale[bot]

Hello 👋 Looks like there was no activity on this issue for the last two months. Do you mind updating us on the status? Is this still reproducible or needed? If yes, just comment on this PR or push a commit. Thanks! 🤗 If there will be no activity in the next two weeks, this issue will be closed (we can always reopen an issue if we need!). Alternatively, use remind command if you wish to be reminded at some point in future.

stale[bot] avatar Oct 11 '21 06:10 stale[bot]

Closing for now as promised, let us know if you need this to be reopened! 🤗

stale[bot] avatar Oct 30 '21 17:10 stale[bot]

Hello 👋 Looks like there was no activity on this issue for the last two months. Do you mind updating us on the status? Is this still reproducible or needed? If yes, just comment on this PR or push a commit. Thanks! 🤗 If there will be no activity in the next two weeks, this issue will be closed (we can always reopen an issue if we need!). Alternatively, use remind command if you wish to be reminded at some point in future.

stale[bot] avatar Mar 02 '22 16:03 stale[bot]

Closing for now as promised, let us know if you need this to be reopened! 🤗

stale[bot] avatar Apr 17 '22 06:04 stale[bot]

Hello 👋 Looks like there was no activity on this issue for the last two months. Do you mind updating us on the status? Is this still reproducible or needed? If yes, just comment on this PR or push a commit. Thanks! 🤗 If there will be no activity in the next two weeks, this issue will be closed (we can always reopen an issue if we need!). Alternatively, use remind command if you wish to be reminded at some point in future.

stale[bot] avatar Sep 21 '22 06:09 stale[bot]

+1, multi-az is a highly available requirement for many companies

singularitatem avatar Jun 15 '23 05:06 singularitatem

This has been implemented already, I think: https://thanos.io/tip/components/receive.md/#az-aware-ketama-hashring-experimental.

GiedriusS avatar Jun 15 '23 06:06 GiedriusS