cortex icon indicating copy to clipboard operation
cortex copied to clipboard

Top flaky tests on `master` by frequency

Open dsabsay opened this issue 8 months ago • 3 comments

The following table lists the tests that have failed on the master branch over the last couple months. These tests can be assumed to be flaky.

This data was generated by a periodic script that parses Action run logs and extracts the tests that failed. Links to the logs in GitHub Actions is included but log retention is short (I think 1 month) so will not be available for all instances.

The data below includes action runs since 2025-02-17. Last updated 2025-06-19.


Fixed tests:

  • TestNativeHistogramFuzz (thanks @SungJin1212)
  • TestSchedulerShutdown_FrontendLoop (thanks @SungJin1212)
  • TestRecoverAlertsPostOutage
Name Number of Failures Links
TestPrometheusCompatibilityQueryFuzz 16 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15
TestQuerierWithBlocksStorageRunningInSingleBinaryMode 14 0 1 2 3 4 5 6 7 8 9 10 11 12 13
TestNativeHistogramFuzz 12 0 1 2 3 4 5 6 7 8 9 10 11
TestRing_ShuffleShardWithLookback_CorrectnessWithFuzzy 12 0 1 2 3 4 5 6 7 8 9 10 11
TestExperimentalPromQLFuncsWithPrometheus 10 0 1 2 3 4 5 6 7 8 9
TestCompactor_ShouldCompactOnlyUsersOwnedByTheInstanceOnShardingEnabledAndMultipleInstancesRunning 9 0 1 2 3 4 5 6 7 8
TestParquetFuzz 6 0 1 2 3 4 5
TestMinimizeSpreadTokenGenerator 6 0 1 2 3 4 5
TestBlocksCleaner_ShouldRemoveBlocksOutsideRetentionPeriod 4 0 1 2 3
TestBackwardCompatibilityQueryFuzz 3 0 1 2
TestStoreGatewayLazyExpandedPostingsSeriesFuzz 3 0 1 2
TestRecoverAlertsPostOutage 3 0 1 2
TestQuerierWithBlocksStorageRunningInMicroservicesMode 3 0 1 2
TestExpandedPostingsCacheFuzz 3 0 1 2
TestQueryFrontendResponseSizeLimit 3 0 1 2
TestDisableChunkTrimmingFuzz 2 0 1
TestLoader_ShouldCacheIndexNotFoundOnBackgroundUpdates 2 0 1
TestPartitionCompactor_ShouldFailCompactionOnTimeout 2 0 1
TestPushRace 2 0 1
TestCompactor_RingLifecyclerShouldAutoForgetUnhealthyInstances 2 0 1

dsabsay avatar Apr 26 '25 02:04 dsabsay

@yeya24 It needs to be reopened since many of the flaky tests remain.

SungJin1212 avatar May 14 '25 04:05 SungJin1212

Logs from TestRing_ShuffleShardWithLookback_CorrectnessWithFuzzy have expired. Do we still see it occurring?

CharlieTLe avatar Jun 19 '25 19:06 CharlieTLe

@CharlieTLe The last time that one failed was 3 months ago it looks like. I just updated the table with the latest data.

dsabsay avatar Jun 20 '25 03:06 dsabsay