Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

GPU support #289

Open
enxebre opened this issue Mar 13, 2020 · 13 comments · May be fixed by #1341
Open

GPU support #289

enxebre opened this issue Mar 13, 2020 · 13 comments · May be fixed by #1341
Labels
kind/feature Categorizes issue or PR as related to a new feature. lifecycle/frozen Indicates that an issue or PR should not be auto-closed due to staleness.

Comments

@enxebre
Copy link
Member

enxebre commented Mar 13, 2020

/kind feature

Describe the solution you'd like
As a user I'd like to declarative define which GPU I want for my instances

Anything else you would like to add:
https://cloud.google.com/compute/docs/gpus/add-gpus#create-gpu-instance-api

We'd need to expose this in the machineInfra API and do the plumbing with GCP API through:

// GuestAccelerators: A list of the type and count of accelerator cards
// attached to the instance.
GuestAccelerators []*AcceleratorConfig `json:"guestAccelerators,omitempty"`

https://raw.githubusercontent.com/googleapis/google-api-go-client/master/compute/v1/compute-gen.go

@k8s-ci-robot k8s-ci-robot added the kind/feature Categorizes issue or PR as related to a new feature. label Mar 13, 2020
@fejta-bot
Copy link

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle stale

@k8s-ci-robot k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Jun 11, 2020
@detiber
Copy link
Member

detiber commented Jun 11, 2020

/lifecycle frozen

@k8s-ci-robot k8s-ci-robot added lifecycle/frozen Indicates that an issue or PR should not be auto-closed due to staleness. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Jun 11, 2020
@dilyevsky
Copy link
Contributor

I would like to claim that feature - need gpu support for GCP provider in our clusters.

@cpanato
Copy link
Member

cpanato commented Jul 22, 2021

go for it @dilyevsky 🎉

@SamuelStuchly
Copy link

@dilyevsky Any progress on this feature ? If not I could take it, since we have already implemented this for Openshift at openshift/cluster-api-provider-gcp#172

@sayantani11
Copy link
Contributor

@SamuelStuchly You can do the necessary changes.... And that will be the best, and then I can proceed with the documentation for gpu-support😄

@kernelpanic77
Copy link

Hello @cpanato @dims @richardcase!
I am Ishan from India. I am currently working in the computer vision domain, particularly in Medical-AI. Along with Distributed Systems, I also have knowledge of machine learning. I found the Kubernetes-sigs project extremely interesting and wanted to contribute to adding GPU support as Part of LFX for Summer 2022. I am currently going through the documentation to get a better understanding of the codebase. The LFX website shows that mentee applications for this mentorship are closed. Please let me know if I can still submit my application for the project.

Looking forward to contributing to this amazing project.

@reyvonger
Copy link

any updates?

@nicolas2bonfils
Copy link

nicolas2bonfils commented Aug 23, 2024

Hello,

3 related PR were closed (#529, #655 and #654) due to inactivity.
What can we do to move forward on GPU support ? Are people interested by working on it (I wiling to participate) ?
Maybe openshift version of this provider can help ?

Regards

@nicolas2bonfils
Copy link

nicolas2bonfils commented Aug 23, 2024

@SamuelStuchly do you think that your code in the Openshift version can also be used for this provider ?

@gecube
Copy link

gecube commented Aug 26, 2024

A very important and necessary thing

@nicolas2bonfils
Copy link

I look at the code and it's very different from the version in the openshift PR. A big work is ahead of us ! 💪

@reyvonger
Copy link

bump

@jwmay2012 jwmay2012 linked a pull request Oct 22, 2024 that will close this issue
3 tasks
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
kind/feature Categorizes issue or PR as related to a new feature. lifecycle/frozen Indicates that an issue or PR should not be auto-closed due to staleness.
Projects
None yet