sui icon indicating copy to clipboard operation
sui copied to clipboard

Investigate time spent after restart in sui / narwhal

Open mwtian opened this issue 2 years ago • 1 comments

Steps to Reproduce Issue

When a sui-node has ran for sometime, after restart narwhal in the node does not immediately start catching up on certificates. e.g. Screenshot 2022-11-20 at 10 22 47 AM

This makes the narwhal catch up after restart much slower than it should. We need to eliminate the inefficiencies here.

Expected Result

After a node restarts, narwhal soon starts catching up.

Actual Result

After a node restarts, it seems to be stuck for >20 min before starting to catch up.

mwtian avatar Nov 20 '22 18:11 mwtian

CPU profile after a node restarts before it started to catch up: perf-dwarf Only ~110 samples were taken in 10s, so the profile may not be accurate numerically. But we can look into some of the notable usages and check if they are on the critical path of node restart, and how they block narwhal catch up.

mwtian avatar Nov 20 '22 18:11 mwtian

@mwtian I assume this is not relevant anymore, so closing it.

stefan-mysten avatar Dec 27 '23 01:12 stefan-mysten