OpenShift icon indicating copy to clipboard operation
OpenShift copied to clipboard

GPU support for ARO - Batch 1

Open sakthi-vetrivel opened this issue 4 years ago • 12 comments

Today, ARO does not support GPU-enabled worker nodes for an ARO cluster. This issue tracks support for them as ARO worker nodes.

NC4as T4 v3 NC8as T4 v3 NC16as T4 v3 NC464as T4 v3

sakthi-vetrivel avatar Jul 27 '20 18:07 sakthi-vetrivel

Is there any moment on this? We are seeing a lot of requirements to have this feature available. Thanks

waynedovey avatar Apr 21 '21 05:04 waynedovey

Is there an ETA on when this will be in public preview?

thegovind avatar Jun 21 '21 19:06 thegovind

any news on this?

philipp1992 avatar Sep 06 '21 12:09 philipp1992

Any updates?

thomasphall avatar Oct 19 '21 18:10 thomasphall

Any update on this?

supernovae avatar Nov 29 '21 13:11 supernovae

We are treating this request as part of two milestones. First milestone will require more steps on part of customers to configure. At the moment it is targeted for Q1 2022. Regarding 2nd milestone, it will be a better experience and will share the ETA once we get closer.

rahulm23 avatar Dec 06 '21 19:12 rahulm23

Any updates?

msftnadavbh avatar Apr 14 '22 19:04 msftnadavbh

Just use aks

philipp1992 avatar Apr 15 '22 05:04 philipp1992

If you require this capability in ARO, please reach out to me @waynedovey , @git2g, @philipp1992, @supernovae, @msftnadavbh . Thank you.

redhatstuart avatar Apr 28 '22 02:04 redhatstuart

Yes we do.I guess that's the reason people come to this issue.

philipp1992 avatar Apr 28 '22 05:04 philipp1992

I wrote up a guide to implementing GPU on ARO - https://mobb.ninja/docs/aro/gpu/ - Documentation is in progress.

supernovae avatar Jun 27 '22 15:06 supernovae

The instance types planned for support are the following: NC4as T4 v3 NC8as T4 v3 NC16as T4 v3 NC464as T4 v3

jboutaud avatar Jul 15 '22 20:07 jboutaud

This is now officially supported. Here is a reference to the article explaining how to use GPUs: https://docs.microsoft.com/en-us/azure/openshift/howto-gpu-workloads

jboutaud avatar Sep 08 '22 18:09 jboutaud

These T4 GPUs may not be suitable for heavy duty training workload.....

We require Tesla V100 or Nvidia A10 GPU support, eagerly awaiting for #276

maulik-modi22 avatar Jun 16 '23 12:06 maulik-modi22