cluster-api-provider-gcp icon indicating copy to clipboard operation
cluster-api-provider-gcp copied to clipboard

GPU support

Open enxebre opened this issue 5 years ago • 13 comments
trafficstars

/kind feature

Describe the solution you'd like As a user I'd like to declarative define which GPU I want for my instances

Anything else you would like to add: https://cloud.google.com/compute/docs/gpus/add-gpus#create-gpu-instance-api

We'd need to expose this in the machineInfra API and do the plumbing with GCP API through:

// GuestAccelerators: A list of the type and count of accelerator cards
// attached to the instance.
GuestAccelerators []*AcceleratorConfig `json:"guestAccelerators,omitempty"`

https://raw.githubusercontent.com/googleapis/google-api-go-client/master/compute/v1/compute-gen.go

enxebre avatar Mar 13 '20 13:03 enxebre

Issues go stale after 90d of inactivity. Mark the issue as fresh with /remove-lifecycle stale. Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta. /lifecycle stale

fejta-bot avatar Jun 11 '20 14:06 fejta-bot

/lifecycle frozen

detiber avatar Jun 11 '20 14:06 detiber

I would like to claim that feature - need gpu support for GCP provider in our clusters.

dilyevsky avatar Jul 21 '21 20:07 dilyevsky

go for it @dilyevsky 🎉

cpanato avatar Jul 22 '21 11:07 cpanato

@dilyevsky Any progress on this feature ? If not I could take it, since we have already implemented this for Openshift at https://github.com/openshift/cluster-api-provider-gcp/pull/172

SamuelStuchly avatar Nov 11 '21 15:11 SamuelStuchly

@SamuelStuchly You can do the necessary changes.... And that will be the best, and then I can proceed with the documentation for gpu-support😄

sayantani11 avatar Dec 15 '21 14:12 sayantani11

Hello @cpanato @dims @richardcase! I am Ishan from India. I am currently working in the computer vision domain, particularly in Medical-AI. Along with Distributed Systems, I also have knowledge of machine learning. I found the Kubernetes-sigs project extremely interesting and wanted to contribute to adding GPU support as Part of LFX for Summer 2022. I am currently going through the documentation to get a better understanding of the codebase. The LFX website shows that mentee applications for this mentorship are closed. Please let me know if I can still submit my application for the project.

Looking forward to contributing to this amazing project.

kernelpanic77 avatar May 26 '22 17:05 kernelpanic77

any updates?

reyvonger avatar Jul 17 '23 12:07 reyvonger