amdsmi
amdsmi copied to clipboard
[Feature]: check NUMA auto-balancing flag
Suggestion Description
NUMA auto-balancing is an important flag and has impact on performance for GPU SW. It would be useful to check this is setting and warn users if it's disabled. Related docs:
- https://rocm.docs.amd.com/en/latest/how-to/rocm-for-ai/inference-optimization/workload.html#disable-numa-auto-balancing
- https://rocm.docs.amd.com/projects/rccl/en/develop/how-to/troubleshooting-rccl.html#collect-the-rccl-microbenchmark-data
Operating System
No response
GPU
No response
ROCm Component
No response