atlasdb Revisit the logic for updating Cassandra nodes

Revisit the logic for updating Cassandra nodes

Open gmaretic opened this issue 6 years ago • 1 comments

Following up on PDS-94960, where we ended up unable to talk to Cassandra.

Since the updates happen only every 2 minutes, trying to talk to a single node that is not blacklisted, it is possible that we got unlucky and ended up in a state where all the old nodes were decommissioned before we were able to find out about any of the new ones.

Aug 12 '19 15:08 gmaretic

This is fishy:

    private List<TokenRange> getTokenRanges() throws Exception {
        return getRandomGoodHost().runWithPooledResource(CassandraUtils.getDescribeRing(config));
    }

Aug 12 '19 16:08 jeremyk-91

atlasdb atlasdb copied to clipboard

Revisit the logic for updating Cassandra nodes

atlasdb
atlasdb copied to clipboard