feature: Support for remote cache #209

ArthurSens · 2023-04-04T18:14:17Z

Checklist:

I've searched for similar issues and couldn't find anything matching
I've discussed this feature in the #k8sgpt slack channel

Is this feature request related to a problem?

Yes
No

Describe the solution you'd like

It would be nice if k8sgpt had the option to cache remotely.

Benefits for the project and its users

With multiple individuals responsible for the same cluster/clusters, e.g. employees in the same company, a remote cache can decrease costs and latency if all those employees use the same cache.

Potential drawbacks

Additional context

AlexsJones · 2023-04-04T19:06:07Z

I thought about this during implementation and agree in principal.
There are some considerations such as locking that cache during update which make it a little hard to coordinate from distributed clients, this is where a SaaS would be potentially quite useful.

Will think on it some more and add into the project backlog, thank you!

matthisholleville · 2023-04-07T13:33:38Z

I would like to propose a first implementation. Have you started something about this @AlexsJones ?

AlexsJones · 2023-04-13T10:03:50Z

I haven't started anything, can you outline what you're thinking?

matthisholleville · 2023-04-13T10:13:49Z

I think this issue needs to be split into two feature requests:

Adding a lock file (similar to Terraform state file) during analysis.
Adding remote storage functionality to store configuration files in S3 buckets, for example.

What do you think?

AlexsJones · 2023-04-13T12:21:37Z

I think S3 can be set to do the locking for you?

matthisholleville · 2023-04-13T14:35:51Z

Yes we could use this solution I think : https://docs.aws.amazon.com/AmazonS3/latest/userguide/object-lock.html . But this solution does not work in the same way with all remote backends ( gcs, git, etc... ). Shouldn't we use a generic solution for all backends like terraform does ?

ArthurSens · 2023-04-13T15:07:07Z

Looking at a User story, I believe Terraform and k8sgpt are used in different ways that might require different approaches here.

From my point of view at least, terraform is often used during CI for resource management and the lock makes sense because this really prevents a possible outage if not present.

Now pardon my lack of knowledge about k8sgpt internals, but what is the worst case possible if the cache doesn't have a lock during updates 😅? It shouldn't be able to actually make changes to a cluster, so although not the best UX, it is not able to cause outages right?

matthisholleville · 2023-04-13T15:22:02Z

Thank you for your answer. It's a good question to ask. From my understanding we need a lock or mechanism to ensure that the cache is not corrupted when parallelizing the analyses.

AlexsJones · 2023-05-02T08:04:32Z

I am doing some work on designing this, will update here when I have something we can discuss further.

xavpaice · 2023-05-03T06:27:04Z

Not sure if it's helpful, but you might want to see https://github.com/replicatedhq/troubleshoot, which has a support-bundle command that grabs a bunch of info from a running k8s cluster and pops it into a tarball, then https://github.com/replicatedhq/sbctl which has a utility capable of running a k8s api against that tarball. I tried it with k8sgpt just now and there's absolutely some gaps, we need to collect more info and fix some things in sbctl that explode when k8sgpt queries it, but that's do-able.

The short story is, you can grab a totally disconnected tarball from a cluster, ship it somewhere, and then serve it up and run kubectl commands against it without having to connect to the live cluster. Handy if you're doing disconnected support work. I'd love to get k8sgpt to read those bundles.

AlexsJones · 2023-05-05T08:15:42Z

Not sure if it's helpful, but you might want to see https://github.com/replicatedhq/troubleshoot, which has a support-bundle command that grabs a bunch of info from a running k8s cluster and pops it into a tarball, then https://github.com/replicatedhq/sbctl which has a utility capable of running a k8s api against that tarball. I tried it with k8sgpt just now and there's absolutely some gaps, we need to collect more info and fix some things in sbctl that explode when k8sgpt queries it, but that's do-able.

The short story is, you can grab a totally disconnected tarball from a cluster, ship it somewhere, and then serve it up and run kubectl commands against it without having to connect to the live cluster. Handy if you're doing disconnected support work. I'd love to get k8sgpt to read those bundles.

I think the support bundle is an interesting idea, we do it in microk8s and it's really useful. I think there are two interesting requirements for this work

something that can be uploaded and shared
something that can represent enough of the cluster data to be useful for someone else running k8sgpt against it.

That second item I think is worthy of its own design discussion to take it further.

AlexsJones · 2023-05-12T07:11:41Z

Related to #381

AlexsJones · 2023-05-22T05:37:51Z

Remove will be supported as of v0.3.3, thanks for the suggestion ❤️

AlexsJones added the enhancement New feature or request label Apr 4, 2023

AlexsJones added this to the v1.0.0 milestone May 12, 2023

AlexsJones mentioned this issue May 18, 2023

feat: caching #439

Merged

4 tasks

AlexsJones closed this as completed May 22, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feature: Support for remote cache #209

feature: Support for remote cache #209

ArthurSens commented Apr 4, 2023

AlexsJones commented Apr 4, 2023

matthisholleville commented Apr 7, 2023

AlexsJones commented Apr 13, 2023

matthisholleville commented Apr 13, 2023

AlexsJones commented Apr 13, 2023

matthisholleville commented Apr 13, 2023

ArthurSens commented Apr 13, 2023

matthisholleville commented Apr 13, 2023

AlexsJones commented May 2, 2023

xavpaice commented May 3, 2023

AlexsJones commented May 5, 2023

AlexsJones commented May 12, 2023

AlexsJones commented May 22, 2023

feature: Support for remote cache #209

feature: Support for remote cache #209

Comments

ArthurSens commented Apr 4, 2023

Is this feature request related to a problem?

Describe the solution you'd like

Benefits for the project and its users

Potential drawbacks

Additional context

AlexsJones commented Apr 4, 2023

matthisholleville commented Apr 7, 2023

AlexsJones commented Apr 13, 2023

matthisholleville commented Apr 13, 2023

AlexsJones commented Apr 13, 2023

matthisholleville commented Apr 13, 2023

ArthurSens commented Apr 13, 2023

matthisholleville commented Apr 13, 2023

AlexsJones commented May 2, 2023

xavpaice commented May 3, 2023

AlexsJones commented May 5, 2023

AlexsJones commented May 12, 2023

AlexsJones commented May 22, 2023