k8s-device-plugin
k8s-device-plugin copied to clipboard
Is there any way in the meantime to request more than 1 replica from each GPU in my node
I have started MPS and used 10 as the division factor, but in our application scenario, we might directly allocate 2 whole GPUs, which is equivalent to specifying nvidia.com/gpu: 20. If I set nvidia.com/gpu > 1, I encounter the error: ‘request for “nvidia.com/gpu”: invalid request: maximum request size for shared resources is 1; found 10, which is unexpected’.
Is there any way in the meantime to request more than 1 replica from each GPU in my node?