backend.ai icon indicating copy to clipboard operation
backend.ai copied to clipboard

feat: Implement `Raft` consensus algorithm for distributed managers

Open rapsealk opened this issue 2 years ago • 1 comments

This PR is related to lablup/backend.ai#415.

rapsealk avatar Jul 10 '22 14:07 rapsealk

Using Raft algorithm, a cluster cannot proceed anymore when the majority of managers are malfunctioning, because it lacks the quorum in election. However, in our case, a cluster should keep working on leader election and log replication even in such condition referred above. Therefore, we should consider a way to use both quorum and majority methods. (Considering fault-tolerance, maybe we can think of rank-based method.)

rapsealk avatar Jul 28 '22 00:07 rapsealk

Migrated to #697

rapsealk avatar Aug 31 '22 05:08 rapsealk