docs-csm icon indicating copy to clipboard operation
docs-csm copied to clipboard

Add warm arpcache for System Recovery rapid_rebuild/basecamp

Open kimjensen-hpe opened this issue 8 months ago • 0 comments

Description

When rebuilding the NCNs from the System Recovery pit-disk. The basecamp call to get metadata will fail with 500 error due to arp cache issues. The arp cache needs to be pre-populated.

This was testing in UK Met TDSY system

root@ncn-m001 2024-05-24 16:22:28 /usr/share/doc/csm/scripts/operations/system_recovery # ./rapid_rebuild.sh
...
Chassis Power Control: Down/Off
Adding arp cache entry for ncn-w004 ip=100.112.129.4 mac=14:02:ec:dd:40:a8
Adding arp cache entry for ncn-w003 ip=100.112.129.5 mac=14:02:ec:db:c7:08
Adding arp cache entry for ncn-w002 ip=100.112.129.6 mac=14:02:ec:df:a4:d8
Adding arp cache entry for ncn-w001 ip=100.112.129.7 mac=14:02:ec:e1:c7:08
Adding arp cache entry for ncn-s003 ip=100.112.129.8 mac=b4:7a:f1:7a:3f:ac
Adding arp cache entry for ncn-s002 ip=100.112.129.9 mac=b4:7a:f1:7a:3e:68
Adding arp cache entry for ncn-s001 ip=100.112.129.10 mac=b4:7a:f1:7a:40:ba
Adding arp cache entry for ncn-m003 ip=100.112.129.11 mac=b4:7a:f1:79:d1:00
Adding arp cache entry for ncn-m002 ip=100.112.129.12 mac=b4:7a:f1:7a:3d:6c
Adding arp cache entry for ncn-m001 ip=100.112.129.13 mac=14:02:ec:df:aa:78
...

Checklist

  • [ ] If I added any command snippets, the steps they belong to follow the prompt conventions (see example).
  • [ ] If I added a new directory, I also updated .github/CODEOWNERS with the corresponding team in Cray-HPE.
  • [ ] My commits or Pull-Request Title contain my JIRA information, or I do not have a JIRA.

kimjensen-hpe avatar May 24 '24 22:05 kimjensen-hpe