Bynar icon indicating copy to clipboard operation
Bynar copied to clipboard

Auto remediate bonding issue

Open rjsuresh opened this issue 6 years ago • 2 comments

We often see issues with interface goes down in a bonding configuration which can be simply brought up/fix by ifup/ifdown (60 to 70%) of the time.

  1. validation of OS
  2. check bonding cat /proc/net/bonding/bond1 (active / passive - based on the type of bonding configuration)
  3. ifdown
  4. ifup
  5. validate.

if not up,

file a ticket to do the following.

  1. check the link on the switch port, if not green engage network team. If that is not fixing,
  2. clean the SFP/reseat it, if not fixing
  3. change the SFP, if not fixing
  4. change the fiber, if not fixing
  5. Raise a Vendor case

rjsuresh avatar Feb 26 '19 18:02 rjsuresh

Yeah I think we can do that.

cholcombe973 avatar Feb 27 '19 17:02 cholcombe973

So there's a few ways i can see to handle this:

  • first would be using a library like this: https://github.com/andrew-d/interfaces-rs to get the interfaces on the machine, check if they're a bond and then check the up/down status and take appropriate action.
  • the second way would be to open the /proc/net/bonding directory and open each file in there and parse the output to see if they're up/down and take appropriate action.
    I generally prefer using library calls over parsing text

cholcombe973 avatar Mar 14 '19 16:03 cholcombe973