Drop accelerator metrics and nvidia implementation #3206

liggitt · 2022-12-01T16:21:25Z

This capability is unused as of Kubernetes 1.25+ (see kubernetes/kubernetes#114204 (comment) and https://github.com/kubernetes/enhancements/tree/master/keps/sig-node/1867-disable-accelerator-usage-metrics)

That means nvidiaManager was always a no-op implementation in Kubernetes 1.25+

This drops the nvidia implementation, collector, and AcceleratorUsageMetrics stats type, which also allows dropping the github.com/mindprince/gonvml dependency, which some license scanners were unhappy with (xref https://groups.google.com/g/kubernetes-sig-architecture/c/dsMVIdPPUK8/m/u3ZjJtcnBwAJ?utm_medium=email&utm_source=footer)

cc @dashpole @bobbypage

k8s-ci-robot · 2022-12-01T16:21:34Z

Hi @liggitt. Thanks for your PR.

I'm waiting for a google member to verify that this patch is reasonable to test. If it is, they should reply with /ok-to-test on its own line. Until that is done, I will not automatically test new commits in this PR, but the usual testing commands by org members will still work. Regular contributors should join the org to skip this step.

Once the patch is verified, the new status will be reflected by the ok-to-test label.

I understand the commands that are listed here.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

bobbypage · 2022-12-01T17:48:21Z

/ok-to-test

bobbypage · 2022-12-01T18:00:08Z

Thanks @liggitt, I agree it makes sense that we can drop accelerator metrics after k8s has deprecated in-tree accelerator metrics and drop this dependency.

For cAdvisor users, we can recommend users to use https://github.com/NVIDIA/dcgm-exporter for accelerator metrics.

bobbypage · 2022-12-02T02:20:00Z

LGTM

liggitt · 2022-12-02T13:06:09Z

thanks, let's plan to tag cadvisor and bump in k/k early in the 1.27 cycle

bobbypage · 2022-12-02T20:17:43Z

Sounds good, will do!

The code was removed in google#3206.

k8s-ci-robot added the needs-ok-to-test label Dec 1, 2022

liggitt mentioned this pull request Dec 1, 2022

add github.com/mindprince/gonvml to unwanted dependencies kubernetes/kubernetes#114204

Merged

1 task

Drop accelerator metrics and nvidia integration

d91f2e6

liggitt force-pushed the drop-nvml branch from f041884 to d91f2e6 Compare December 1, 2022 16:23

k8s-ci-robot added ok-to-test and removed needs-ok-to-test labels Dec 1, 2022

pacoxu mentioned this pull request Dec 2, 2022

drop nvml from cadvisor and k/k #3205

Closed

bobbypage approved these changes Dec 2, 2022

View reviewed changes

bobbypage merged commit a52ec5d into google:master Dec 2, 2022

This was referenced Dec 2, 2022

WIP: Update cadvisor to 0.46.1 kubernetes/kubernetes#114254

Closed

License Scan report kubernetes/community#6992

Closed

eero-t mentioned this pull request Oct 3, 2023

Dont get GPU Metrics on Unraid #3374

Closed

bobrik added a commit to bobrik/cadvisor that referenced this pull request Jan 19, 2024

Remove mentions of accelerator from the docs

13df731

The code was removed in google#3206.

bobrik mentioned this pull request Jan 19, 2024

Remove mentions of accelerator from the docs #3458

Merged

rajivshah3 mentioned this pull request May 16, 2024

fix: Remove accelerator from disabled cadvisor metrics iotaledger/node-docker-setup#44

Merged

xigang mentioned this pull request Dec 9, 2024

AMD GPU metrics? #3625

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Drop accelerator metrics and nvidia implementation #3206

Drop accelerator metrics and nvidia implementation #3206

liggitt commented Dec 1, 2022

k8s-ci-robot commented Dec 1, 2022

bobbypage commented Dec 1, 2022

bobbypage commented Dec 1, 2022

bobbypage commented Dec 2, 2022

liggitt commented Dec 2, 2022

bobbypage commented Dec 2, 2022 •

edited

Loading

Drop accelerator metrics and nvidia implementation #3206

Drop accelerator metrics and nvidia implementation #3206

Conversation

liggitt commented Dec 1, 2022

k8s-ci-robot commented Dec 1, 2022

bobbypage commented Dec 1, 2022

bobbypage commented Dec 1, 2022

bobbypage commented Dec 2, 2022

liggitt commented Dec 2, 2022

bobbypage commented Dec 2, 2022 • edited Loading

bobbypage commented Dec 2, 2022 •

edited

Loading