atlasdb issues

[PDS-117310] KvTableMappingService.updateTableMap is spammed from TS threads when a table is deleted

``` Count: 12595 "com.palantir.logsafe.exceptions.SafeRuntimeException: I exist to show you the stack trace at at com.palantir.atlasdb.keyvalue.impl.KvTableMappingService.lambda$updateTableMap$0(KvTableMappingService.java:95) at java.util.concurrent.atomic.AtomicReference.updateAndGet(AtomicReference.java:179) at com.palantir.atlasdb.keyvalue.impl.KvTableMappingService.updateTableMap(KvTableMappingService.java:95) at com.palantir.atlasdb.keyvalue.impl.KvTableMappingService.getMappedTableRef(KvTableMappingService.java:184) at com.palantir.atlasdb.keyvalue.impl.KvTableMappingService.getMappedTableName(KvTableMappingService.java:175) at com.palantir.atlasdb.keyvalue.impl.TableRemappingKeyValueService.deleteAllTimestamps(TableRemappingKeyValueService.java:133) at com.palantir.atlasdb.keyvalue.impl.TableSplittingKeyValueService.deleteAllTimestamps(TableSplittingKeyValueService.java:149) at ``` Jeremy...

jeremyk-91

Use metric-schema?

Have you considered using [metric-schema](https://github.com/palantir/metric-schema) here? A large internal project would really appreciate the generated documentation! _Originally posted by @carterkozak in https://github.com/palantir/atlasdb/pull/4719#issuecomment-615411862_

gmaretic

RangeRequest Breaches The hashCode() Contract

``` @Test public void wat() { RangeRequest r1 = RangeRequest.builder() .startRowInclusive(PtBytes.toBytes("tom")) .endRowExclusive(PtBytes.toBytes("zzzz")) .retainColumns(ImmutableList.of(PtBytes.toBytes("name"))) .build(); RangeRequest r2 = RangeRequest.builder() .startRowInclusive(PtBytes.toBytes("tom")) .endRowExclusive(PtBytes.toBytes("zzzz")) .retainColumns(ImmutableList.of(PtBytes.toBytes("name"))) .build(); assertThat(r1).isEqualTo(r2); // passes assertThat(r1.hashCode()).isEqualTo(r2.hashCode()); // :( assertThat(ImmutableSet.of(r1, r2).size()).isEqualTo(1);...

jeremyk-91

CassandraVerifier#waitForSchemaVersions blocks on Cassandra 3 upgrade

3

https://github.com/palantir/atlasdb/blob/develop/atlasdb-cassandra/src/main/java/com/palantir/atlasdb/keyvalue/cassandra/CassandraVerifier.java#L213 The `CassandraVerifier` waits for schema agreement, which is generally a smart thing to do when creating a keyspace. However, in the case of a Cassandra 3 upgrade, the schemas...

leonz

[PDS-111714] Add Tracing for TimeLock RPC Client Calls

1

Maybe not required if this comes for free with Conjure changes, but worth investigating.

jeremyk-91

Add metrics and alerting for Atlas retry timeouts above 10s

Linked to #4598, we want to track how long we are retrying for, and alert when we retry for over 10s as this was our old limit. Related to PDS-111849.

Jolyon-S

[PDS-111356] Possible Race Condition in Metric Deregistration

3

See internal issue PDS-111356 for context. An internal product that utilises multiple `TransactionManager` instances reported failures talking to two Cassandra nodes. On further investigation, these failures were generated when polling...

jeremyk-91

atlasdb
atlasdb copied to clipboard

Metadata

[PDS-117310] KvTableMappingService.updateTableMap is spammed from TS threads when a table is deleted

Use metric-schema?

RangeRequest Breaches The hashCode() Contract

CassandraVerifier#waitForSchemaVersions blocks on Cassandra 3 upgrade

[PDS-111714] Add Tracing for TimeLock RPC Client Calls

Add metrics and alerting for Atlas retry timeouts above 10s

[PDS-111356] Possible Race Condition in Metric Deregistration

Improve blacklisting methodology in face of bad cassandra node/performance

Timelock nodes should propose leadership in the presence of consistent slowness.

Add cluster management docs for cluster migrations

← Metadata

Owner

Metadata

atlasdb atlasdb copied to clipboard

Metadata

← Metadata

Owner

Metadata

atlasdb
atlasdb copied to clipboard