Abhijat Malviya

Results 166 comments of Abhijat Malviya

https://buildkite.com/redpanda/redpanda/builds/25904#018726d9-2297-467c-b6c2-9b512ff330d1

https://buildkite.com/redpanda/redpanda/builds/26146#01873657-4218-4a7b-b7cc-84f8c84e2c9f

failed on arm nightly CDT run: https://buildkite.com/redpanda/vtools/builds/5295#0185c159-c9c2-4ea8-9a27-107193993aa2 MaintenanceTest.test_maintenance.use_rpk=False (2/4 runs) MaintenanceTest.test_maintenance.use_rpk=True (2/4 runs) MaintenanceTest.test_maintenance_sticky.use_rpk=False (2/4 runs) MaintenanceTest.test_maintenance_sticky.use_rpk=True (2/4 runs)

Looking at the logs it appears that this issue happens after a leadership transfer. * Node A performs spillover, removes entries from manifest upto and including offset 2140. * Node...

This offset gap appears to be due to anomaly inserted by the test: ``` [INFO - 2024-01-12 20:23:42,074 - cloud_storage_scrubber_test - _assert_segment_metadata_anomalies - lineno:510]: Removing segment with meta {'base_offset': 2141,...

Since the spillover meta is created by iterating over individual segments of the manifest, it starts at offset 2212, but the validation works with the manifest's start offset field which...

reported stale by pandatriage, closing. ``` | 763 | 13841 | 143 | 3 | 2023-10-09 | CloudArchiveRetentionTest.test_delete | CI Failure (segments not deleted) in `CloudArchiveRetentionTest.test_delete` | ```

Ran this on repeat in CI and in the debug build I did see this get blocked after one test failed: ``` t=1701077829694116: /var/lib/buildkite-agent/builds/buildkite-amd64-builders-i-03788b59bfc51b527-1/redpanda/vtools/src/v/archival/tests/ntp_archiver_test.cc(1780): fatal error: in "test_manifest_spillover": critical check...