[Feature]: expose gpu model name as resource
Suggestion Description
Hello AMD team,
We would like to request an enhancement to the device plugin's StrategyMixed feature to allow GPU resources to be exposed using their model names (e.g., amd.com/mi250x, amd.com/mi210x) instead of the generic amd.com/gpu.
We’re aware of the existing resource_naming_strategy option, but it currently exposes partition types rather than the actual GPU model names.
Is there any plan to extend the plugin to support this level of resource granularity? FYI, Nvida device plugin has the same capability.
If this aligns with your roadmap, we’d be happy to contribute to help implement it.
Thanks for your help.
Operating System
No response
GPU
No response
ROCm Component
Device plugin
Good issue.
Good issue +1
Hi @baddoub , thanks for raising this feature request. This is a good idea for the cluster that has a mixture of different GPU models.
Currently we don't have this functionality because:
- most of the customers are using
amd.com/gpuand some of them occasionally need GPU partitioning specific name - many ecosystem partner or projects are by default looking at the
amd.com/gpuas GPU resource name for their integration.
PR is welcome to enable this feature, I'd suggest keep amd.com/gpu as default resource name, and users could optionally use resource_naming_strategy to turn on the feature you requested.