skypilot icon indicating copy to clipboard operation
skypilot copied to clipboard

[k8s] Show currently available GPUs on Kubernetes cluster

Open romilbhardwaj opened this issue 1 year ago • 4 comments

User requested - in addition to showing list of supported GPUs, they would like to see realtime utilization for GPUs so they can know how many GPUs they can launch. Kubernetes dashboard is nice, but doesn't show live GPU usage.

romilbhardwaj avatar Dec 05 '23 01:12 romilbhardwaj

following

gmittal avatar Dec 05 '23 06:12 gmittal

Potentially misunderstanding this here - does the user mean like 5/8 gpus are used on the cluster (where 5 gpus are occupied by other pods)?

In this case, it can be done by dynamically fetching node.status.allocatable_capacity in kubectl get nodes for the nvidia.com/gpu field.

michaelzhiluo avatar Dec 15 '23 23:12 michaelzhiluo

This issue is stale because it has been open 120 days with no activity. Remove stale label or comment or this will be closed in 10 days.

github-actions[bot] avatar Apr 14 '24 02:04 github-actions[bot]

Bumping this up

romilbhardwaj avatar Apr 22 '24 02:04 romilbhardwaj