John Spray
John Spray
## Cover letter Updates to ManyPartitionsTest: - Make it dynamically select size based on enviroment: this is useful for developers but also for running on different instance types without having...
FAIL test: PartitionMoveInterruption.test_cancelling_partition_move_x_core.replication_factor=3.unclean_abort=True.recovery=restart_recovery (1/84 runs) failure at 2022-07-23T08:22:18.320Z: in job https://buildkite.com/redpanda/redpanda/builds/12971#018229ae-5aed-4afd-9a59-3d4331bfdc09 ``` Protocol violation: request version rpc::transport_version::v1 incompatible with reply version rpc::transport_version::v2 ``` While this is "only" a BadLogLines, it's...
FAIL test: ConsumerOffsetsMigrationTest.test_migrating_consume_offsets.failures=True.cpus=3 (1/51 runs) failure at 2022-07-05T19:22:17.160Z: TimeoutError("Consumer failed to consume up to offsets {TopicPartition(topic='topic-kxuttrpvwb', partition=4): 96572, TopicPartition(topic='topic-kxuttrpvwb', partition=1): 33266, TopicPartition(topic='topic-kxuttrpvwb', partition=2): 97585, TopicPartition(topic='topic-kxuttrpvwb', partition=5): 136721, TopicPartition(topic='topic-kxuttrpvwb', partition=3): 79302,...
This looks like a fairly consistent failure two nights in a row: FAIL test: TestMirrorMakerService.test_consumer_group_mirroring.source_type=kafka (2/2 runs) failure at 2022-08-01T10:40:48.978Z: TimeoutError('') in job https://buildkite.com/redpanda/vtools/builds/3077#01825803-32b3-4fee-8e12-243a0177833c failure at 2022-07-31T11:04:20.402Z: TimeoutError('') in job...
FAIL test: AWSRoleFetchTests.test_write (2/2 runs) failure at 2022-08-01T10:40:48.978Z: TimeoutError('Segments were not removed') in job https://buildkite.com/redpanda/vtools/builds/3077#01825803-32b3-4fee-8e12-243a0177833c failure at 2022-07-31T11:04:20.402Z: TimeoutError('Segments were not removed') in job https://buildkite.com/redpanda/vtools/builds/3073#018252da-dd82-4dac-bad9-ea37369ded6e FAIL test: STSRoleFetchTests.test_write (2/2 runs)...
## Cover letter admin: read-after-write consistency for config status on leader node Previously, after writing a config update, API clients could do a /status query to the same node and...
## Cover letter This is to narrow down a spooky failure seen when the consumer is apparently not waiting for a high enough offset, due to seeing a lower offset...
Bunch of tests affected by this in the last 24h, all running in clustered ducktape: FAIL test: OpenBenchmarkTest.test_multiple_topics_omb (1/2 runs) failure at 2022-07-07T10:51:23.073Z: in job https://buildkite.com/redpanda/vtools/builds/2806#0181d77e-2fe1-4594-876a-062eb75b50c0 FAIL test: ManyClientsTest.test_many_clients (1/2...
## Cover letter Describe in plain language the motivation (bug, feature, etc.) behind the change in this PR and how the included commits address it. Fixes #ISSUE-NUMBER, Fixes #ISSUE-NUMBER, ......
## Background On a system in a steady state, the current `describe` commands work fine. But if a partition is going through changes, rpk will tend to get NOT_LEADER errors...