celestia-node das: Parallelise `catchUp` method

das: Parallelise `catchUp` method

Open renaynay opened this issue 2 years ago • 5 comments

Supplement https://github.com/celestiaorg/celestia-node/pull/473 by additionally parallelising the DASing of past headers.

TODOs mentioned here.

Feb 28 '22 09:02 renaynay

Screenshot 2022-03-21 at 15 08 58

Mar 21 '22 14:03 renaynay

Now that DASState is almost implemented, we should consider the storing of errors for parallel catchUp workers whenever this issue is tackled.

Jun 28 '22 12:06 renaynay

#870 introduces a lock into CacheAvailability which can create performance degrading lock contention which we should keep in mind while implementing this.

Copy of the comment on the PR:

By contention point TODO I meant specifically this place. Basically, the routine hitting the autobatching threshold will lock all the other writers in future parallelized DASer until the batch is synced on disk. This fundamentally degrades the parallelization of a DASer. The solution I know is to decouple reads and writes like in header.Store

Jun 29 '22 12:06 Wondertan

Idea for parallelisation:

DASState will contain 1. sampleState (state of the sampling routine), 2. array of worker catchUp routine states

type State struct {
  sampleStateLk sync.Mutex
  sampleState RoutineState

  workerStates []WorkerState
}

type WorkerState struct {
  ID uint64
  From, To uint64
  LastSampledHeight uint64
  Err SampleErr
}

Ref #848

Jul 01 '22 14:07 renaynay

We should revisit https://github.com/celestiaorg/celestia-node/issues/504 and close it if after implementation of parallelisation, the issue is not reproducible again

Jul 26 '22 13:07 Bidon15

celestia-node celestia-node copied to clipboard

das: Parallelise `catchUp` method

celestia-node
celestia-node copied to clipboard