onload AF_XDP bricks aws ec2
Environment
- OS: RHEL 9
- Kernel: 5.14.0-70
- Instance Type: m5zn.metal
- Network Driver: ENA
- Commit: f8458aa
- Branch: master (as of this writing)
Issue Description
I am attempting to install OpenOnload configured for XDP on the ENA driver. The setup process runs smoothly until I test the application, at which point my SSH connection is disrupted, and the instance becomes unresponsive.
Steps to Reproduce
- Install all required packages.
- Execute
make. - Run
./onload_install --build-profile cloudas per the documentation. - Successfully load sfc modules with
sudo onload_tool reload. - Attach a secondary ENA interface (name: eth1, driver: ena) to my EC2 instance.
- Register the new ENA interface with sfc_resource:
echo eth1 > /sys/module/sfc_resource/afxdp/register. - Disable SELinux and run my test application:
onload python3 client.py [server_ip] [port]. - Post-step 7, my SSH connection is disrupted, and I cannot reconnect to the instance.
Additional Information
I cannot post the system logs as the instance becomes unresponsive. I referenced this previous discussion where experiments with OpenOnload on ENA via XDP showed mixed results. However this post is two years out of date which is why I'm oppening a new issue.
I am relatively new to OpenOnload, so any advice on potential missteps or additional configurations would be greatly appreciated.
Thank you for any assistance you can provide.
You ever find a resolution to this @lparkersc ? I just hit the same issue ☠️
I hit the same issue.
- kernel version : 6.8.0-1024-aws