redpanda icon indicating copy to clipboard operation
redpanda copied to clipboard

CI Failure (key symptom) in `OMBValidationTest.test_max_partitions`

Open vbotbuildovich opened this issue 1 year ago • 2 comments

https://buildkite.com/redpanda/vtools/builds/13711

Module: rptest.redpanda_cloud_tests.omb_validation_test
Class: OMBValidationTest
Method: test_max_partitions
test_id:    OMBValidationTest.test_max_partitions
status:     FAIL
run time:   787.923 seconds

AssertionError("['Metric aggregatedEndToEndLatency99pct, value 131.356, Expected to be <= 120.0, check failed.', 'Metric aggregatedEndToEndLatency999pct, value 310.797, Expected to be <= 200.0, check failed.']")
Traceback (most recent call last):
  File "/opt/.ducktape-venv/lib/python3.10/site-packages/ducktape/tests/runner_client.py", line 184, in _do_run
    data = self.run_test()
  File "/opt/.ducktape-venv/lib/python3.10/site-packages/ducktape/tests/runner_client.py", line 276, in run_test
    return self.test_context.function(self.test)
  File "/home/ubuntu/redpanda/tests/rptest/services/cluster.py", line 103, in wrapped
    r = f(self, *args, **kwargs)
  File "/home/ubuntu/redpanda/tests/rptest/redpanda_cloud_tests/omb_validation_test.py", line 458, in test_max_partitions
    benchmark.check_succeed()
  File "/home/ubuntu/redpanda/tests/rptest/services/openmessaging_benchmark.py", line 384, in check_succeed
    OMBSampleConfigurations.validate_metrics(self._metrics,
  File "/home/ubuntu/redpanda/tests/rptest/services/openmessaging_benchmark_configs.py", line 123, in validate_metrics
    assert is_valid, str(results)
AssertionError: ['Metric aggregatedEndToEndLatency99pct, value 131.356, Expected to be <= 120.0, check failed.', 'Metric aggregatedEndToEndLatency999pct, value 310.797, Expected to be <= 200.0, check failed.']

JIRA Link: CORE-2959

vbotbuildovich avatar May 14 '24 21:05 vbotbuildovich

There are 2 readings in a row above allowed:

170.727, 370.477,

"endToEndLatency99pct" : [ 43.147, 51.325, 47.856, 46.15, 42.678, 44.578, 45.264, 43.281, 46.71, 45.613, 44.348, 45.829, 44.663, 44.317, 48.659, 45.526, 170.727, 370.477, 43.558, 49.542, 49.821, 47.037, 51.814, 46.226, 44.755, 47.846, 44.819, 44.503, 42.481, 48.834 ],

209.434, 456.771

"endToEndLatency999pct" : [ 54.919, 157.728, 61.438, 59.856, 56.24, 55.506, 58.216, 57.755, 67.128, 132.129, 56.255, 58.183, 57.662, 53.804, 59.724, 56.227, 209.434, 456.771, 54.495, 64.736, 64.122, 122.681, 77.082, 60.931, 55.851, 60.86, 66.105, 60.409, 52.206, 62.648 ],

rpdevmp avatar May 15 '24 00:05 rpdevmp

*https://buildkite.com/redpanda/vtools/builds/13741

vbotbuildovich avatar May 15 '24 21:05 vbotbuildovich

*https://buildkite.com/redpanda/vtools/builds/14516

vbotbuildovich avatar Jun 07 '24 21:06 vbotbuildovich

*https://buildkite.com/redpanda/vtools/builds/14618 *https://buildkite.com/redpanda/vtools/builds/14622

vbotbuildovich avatar Jun 12 '24 21:06 vbotbuildovich

*https://buildkite.com/redpanda/vtools/builds/14862

vbotbuildovich avatar Jun 20 '24 03:06 vbotbuildovich

@rpdevmp - but are they disk spikes? If they fit the pattern of temporary blips in disk performance then we should still treat it as a spike, e.g., maybe relax the rule about only 1 in a row.

travisdowns avatar Jun 23 '24 04:06 travisdowns

*https://buildkite.com/redpanda/vtools/builds/15506 *https://buildkite.com/redpanda/vtools/builds/15544

vbotbuildovich avatar Jul 15 '24 09:07 vbotbuildovich

*https://buildkite.com/redpanda/vtools/builds/15673

vbotbuildovich avatar Jul 17 '24 21:07 vbotbuildovich

*https://buildkite.com/redpanda/vtools/builds/15781

vbotbuildovich avatar Jul 19 '24 21:07 vbotbuildovich

*https://buildkite.com/redpanda/vtools/builds/15886

vbotbuildovich avatar Jul 23 '24 21:07 vbotbuildovich

*https://buildkite.com/redpanda/vtools/builds/16111

vbotbuildovich avatar Aug 02 '24 21:08 vbotbuildovich

*https://buildkite.com/redpanda/vtools/builds/16378

vbotbuildovich avatar Aug 09 '24 21:08 vbotbuildovich

*https://buildkite.com/redpanda/vtools/builds/16404

vbotbuildovich avatar Aug 10 '24 21:08 vbotbuildovich

Closing older-bot-filed CI issues as we transition to a more reliable system.

piyushredpanda avatar Sep 24 '24 03:09 piyushredpanda