snarkOS icon indicating copy to clipboard operation
snarkOS copied to clipboard

Safe validator initial start

Open kyle-u410 opened this issue 1 year ago • 1 comments

🚀 Feature

Less down time when starting a validator.

Motivation

For network launch in testnet, starting too many validators at the same time has caused network halts. These halts are avoidable if the validator is synced when the bond_validator call is made.

Implementation options

  1. Running a node with start --validator in the case bond has not been called will run the node as client watching for bond_validator. As soon as that transaction is seen the node will shift to validator mode.
  2. Allow bond_public transaction to be sent from validator when it catches up to head. This is effectively a script that runs the validator as a client and then when the client is close to tip issues the bond_validator

kyle-u410 avatar Jun 12 '24 15:06 kyle-u410

In our testing, we start validators all at once all the time and we don't see network halts. What are the details of these halts?

damons avatar Jun 12 '24 18:06 damons

Thank you for the suggestion!

  1. would get quite clunky and potentially incur a large performance hit when implemented within snarkOS
  2. could be part of the solution, I would encourage coordination with ANF on such a script as part of the validator onboarding guide. It will also have to monitor whether the submitted transaction succeeded.

vicsn avatar Mar 10 '25 19:03 vicsn