Marc Handalian

Results 41 comments of Marc Handalian

@Poojita-Raj So the failed tests in SegmentReplicationIndexShardTests are because we are using LuceneTestCase's `newFSDirectory`, which on close runs a [checkIndex](https://github.com/apache/lucene/blob/main/lucene/test-framework/src/java/org/apache/lucene/tests/store/MockDirectoryWrapper.java#L909) that will run various tests to identify corruption. We aren't...

I think we can actually look at disabling fsyncs entirely on replicas if we are triggering a commit/fsync when the shard closes. That would make this implementation much simpler by...

@Rishikesh1159 I don't think you need to change the visibility of the replication tracker. `SegmentReplicationTargetService` invokes `IndexShard#shouldProcessCheckpoint`, you can add a check there?

@tan31989 Thanks for your interest in this issue. > Hi, > > I am giving my thoughts on this issue. Ideally, feature flags should be allowed to toggle, but with...

I think 2 is the best option given we want this as a best effort. I also wouldn't be worried about it delaying shard promotion right now, we can set...

> I think the issue is that when we stop the replication for CCR, we close and reopen the index to reload the Engine. The index close is being handled...

I've added a commit to this ensuring cancelling primary allocation succeeds and that the replica is promoted & primary recreated as a replica. In testing that I found we were...

@dreamer-89 Yeah this should not be failing the replica, it would catch up to the new cp after the current replication event completes. I think this is happening bc we...

Have opened #4182 to cover moving this to allocationID over node. I have not been able to repro after applying this change but I think we should leave this open...