grin icon indicating copy to clipboard operation
grin copied to clipboard

node locked up during serving of txhashset to syncing peer

Open antiochp opened this issue 4 years ago • 5 comments

Node appears to be locked up when serving a txhashset archive to a syncing peer. The txhashset "response" is a small message with an "attachment" containing the bytes for txhashset.zip. My understanding is this is served from a temporary file so there should be no reason for this to lock the db or the MMR backend for the duration. We do potentially lock the peers map though?

Not sure if this is a recent regression or if this has actually been happening for a while and we just never noticed.

Screen Shot 2020-02-26 at 10 06 35 AM

Probably related - https://github.com/mimblewimble/grin/issues/3218

antiochp avatar Feb 26 '20 10:02 antiochp

Looking at the "longest chain" info at the top of the TUI we do appear to be staying in sync which is interesting. The peers list is not being updated correctly (note the last seen times of 1846s etc.) This does suggest this is the peers list that is getting locked up.

antiochp avatar Feb 26 '20 10:02 antiochp

Confirmed the node does get back into a stable state once the txhashset response has completed.

antiochp avatar Feb 26 '20 10:02 antiochp

Screen Shot 2020-02-26 at 10 17 29 AM

antiochp avatar Feb 26 '20 10:02 antiochp

When is the fix expected?

githubusrR avatar Apr 15 '20 10:04 githubusrR