OpenSearch icon indicating copy to clipboard operation
OpenSearch copied to clipboard

[AUTOCUT] Gradle Check Flaky Test Report for IndexShardTests

Open opensearch-ci-bot opened this issue 1 year ago • 1 comments

Flaky Test Report for IndexShardTests

Noticed the IndexShardTests has some flaky, failing tests that failed during post-merge actions.

Details

Git Reference Merged Pull Request Build Details Test Name
17e8935863fa8e3dfc0e248b5ad315d8e8b7266d 13844 39312 org.opensearch.index.shard.IndexShardTests.testRefreshLevelRestoreShardFromRemoteStore
1bdf43dcf786f7e19855e1709ec80d0ae038b48c 13866 39441 org.opensearch.index.shard.IndexShardTests.testRefreshLevelRestoreShardFromRemoteStore
679ccac846b36bd060018b81fd52059adec8001d 14248 40881 org.opensearch.index.shard.IndexShardTests.testRefreshLevelRestoreShardFromRemoteStore

The other pull requests, besides those involved in post-merge actions, that contain failing tests with the IndexShardTests class are:

For more details on the failed tests refer to OpenSearch Gradle Check Metrics dashboard.

opensearch-ci-bot avatar Jun 13 '24 21:06 opensearch-ci-bot

Fixed with https://github.com/opensearch-project/OpenSearch/pull/14418

sachinpkale avatar Jun 21 '24 07:06 sachinpkale

[Storage Triage - attendees 1 2 3 4 5 6 7 8]

@sachinpkale Looks like the issue was not fixed! Please feel free to raise a PR to address!

linuxpi avatar Jul 25 '24 15:07 linuxpi

Tried reproducing this test by running the test for 2k+ iterations, but the failure did not occur. I also could not find any stack trace since all the build links have expired.

ashking94 avatar Jan 27 '25 04:01 ashking94

REPRODUCE WITH: ./gradlew ':server:test' --tests "org.opensearch.index.shard.IndexShardTests.testRestoreSearchOnlyShardFromStore" -Dtests.seed=90940F9D9F354EDC -Dtests.security.manager=true -Dtests.jvm.argline="-XX:TieredStopAtLevel=1 -XX:ReservedCodeCacheSize=64m" -Dtests.locale=ee-TG -Dtests.timezone=Asia/Ashgabat -Druntime.java=21

IndexShardTests > testRestoreSearchOnlyShardFromStore FAILED
    org.apache.lucene.index.CorruptIndexException: misplaced codec footer (file truncated?): length=0 but footerLength==16 (resource=metadata__9223372036854775746__9223372036854775804__9223372036854775802__9223372036854775805__-1039764442__9223370290998210694__2)
        at __randomizedtesting.SeedInfo.seed([90940F9D9F354EDC:C6CDC876E014858E]:0)
        at app//org.apache.lucene.codecs.CodecUtil.checksumEntireFile(CodecUtil.java:616)
        at app//org.opensearch.common.io.VersionedCodecStreamWrapper.readStream(VersionedCodecStreamWrapper.java:64)
        at app//org.opensearch.index.store.RemoteSegmentStoreDirectory.readMetadataFile(RemoteSegmentStoreDirectory.java:258)
        at app//org.opensearch.index.store.RemoteSegmentStoreDirectory.readLatestMetadataFile(RemoteSegmentStoreDirectory.java:247)
        at app//org.opensearch.index.store.RemoteSegmentStoreDirectory.init(RemoteSegmentStoreDirectory.java:154)
        at app//org.opensearch.index.store.RemoteSegmentStoreDirectory.<init>(RemoteSegmentStoreDirectory.java:141)
        at app//org.opensearch.index.shard.IndexShardTestCase.createRemoteSegmentStoreDirectory(IndexShardTestCase.java:814)
        at app//org.opensearch.index.shard.IndexShardTestCase.createRemoteStore(IndexShardTestCase.java:802)
        at app//org.opensearch.index.shard.IndexShardTestCase.newShard(IndexShardTestCase.java:672)
        at app//org.opensearch.index.shard.IndexShardTestCase.newShard(IndexShardTestCase.java:528)
        at app//org.opensearch.index.shard.IndexShardTestCase.reinitShard(IndexShardTestCase.java:892)
        at app//org.opensearch.index.shard.IndexShardTestCase.reinitShard(IndexShardTestCase.java:863)
        at app//org.opensearch.index.shard.IndexShardTestCase.reinitShard(IndexShardTestCase.java:858)
        at app//org.opensearch.index.shard.IndexShardTests.testRestoreSearchOnlyShardFromStore(IndexShardTests.java:3056)

@mch2 @vinaykpud Is this perhaps related to the reader/writer separation work?

andrross avatar May 12 '25 16:05 andrross