Marco Pracucci
Marco Pracucci
An error connecting to memcached or an i/o timeout doesn't compromise the correct functioning of Cortex but can have a negative impact on query performances. That being said, temporarily and...
> `"expanding series: failed to get store-gateway replication set owning the block 01F4AB90AE4ZS92FN8T8RMT5JW: at least 1 healthy replica required, could only find 0"` This error means that there are not...
> a little over 38,000 -- I run compactors with 5 replicas. 0 reports 38,000 and 4 reports 380. ith the default config, we compact blocks up to 24h so,...
> Looks like compactions were failing for our largest tenant, causing a slew of problems. With regards to this issue, is there any open issue left or can we close...
@bboreham What's the sentiment if we would change https://github.com/weaveworks/common/blob/bd288de53d57de300fa286688ce2fc935687213f/httpgrpc/server/server.go#L67-L69 to return an error also in case of a 4xx?
> pod-0 is marked as unhealthy, because it can't talk to pod-2 Why is pod-0 marked as unhealthy? I can't understand this.
> > Why is pod-0 marked as unhealthy? I can't understand this. > > I'm not sure. Looking at the code, the `/ready` state is supposed to latch. My observations...
One theory is that the higher latency is given by the fact that we have to wait for the slowest ingester (because of the quorum) if there already 1 unhealthy...
> I'm assuming that corruption, in this case, means we've failed to list/download the rule group. The listing is triggered by `r.listRules()`. Rule groups content is not downloaded and/or decoded...