Delete resources no longer in git #1442

Timer · 2018-10-10T19:53:10Z

This pull request adds stack^[1] tracking. Stack tracking attaches metadata to every object created by Flux so resources removed from the "Config Repo" can be pruned.

Every object or updated created by Flux gets assigned a label with the stack's name, e.g. (flux.weave.works/stack: default). Additionally, it also gets a checksum which is an aggregate of all resources in that stack: flux.weave.works/stack-checksum: cf23df2207d99a74fbe169e3eba035e633b65d94.

After cluster changes have been applied, a full list of cluster resources are retrieved by the stack label. Any resources who's stack-checksum do not match the most recently applied checksum are pruned.

Current Test Image

~~You can test this PR by using the following Docker image: timer/flux-with-stacks:feature-stack-tracking-1d481d0.~~
Outdated.

TODOs

Put behind a formal flag, e.g. --experimental-deletion
cluster.Cluster's Sync method should probably receive a more appropriate object than
2x map[string]policy.Update
cluster.applyMetadata should be refactored and simplified
cluster.ExportByLabel should probably be given a few failure/retry cases to handle; what about when we're limited by RBAC?
Skipped resources (v1:ComponentStatus and v1:Endpoints) shouldn't happen at listing level, but probably delete/diff level
Should sync.garbageCollect mark the object for deletion and only prune it next run? This could help with any race conditions in the K8s API server (I didn't run into any, but you never know).
Logger is broken in sync.Sync so fmt.Printf had to be used. We should figure this out.
Tests, tests, tests.
flux.weave.works/immortal

Post-merge follow-ups

Multi stack support to minimize blast radius
Delete K8s resources in Background propagation mode

[1]: an arbitrary collection of k8s resources

oliviabarrick · 2018-10-11T15:23:47Z

Can you fill out the description a little bit more to describe what this does? I’m not exactly sure what “stack tracking” means or what a “stack” is in this context.

hiddeco · 2018-10-11T15:38:07Z

@justinbarrick see #738 (comment) and #738 (comment)

oliviabarrick · 2018-10-11T16:03:49Z

What is the purpose of the checksum? It seems to me that if we just added a label to the resources (any label really would work), you could still sweep for resources that have the label but do not match any defined resource in the repository based on type / namespace / name. AFAIK this is how Helm works.

I can see checksums being useful if we just want to know if we need to update a resource, but I would expect just using kubectl gives something similar to us already.

squaremo · 2018-10-12T10:20:07Z

What is the purpose of the checksum? It seems to me that if we just added a label to the resources (any label really would work), you could still sweep for resources that have the label but do not match any defined resource in the repository based on type / namespace / name. AFAIK this is how Helm works.

Yes, true. When you have access to the names of the things that should be there, all you need to know for deletion is whether it was created by flux in the first place. So the checksum is of use only if you are concerned about reapplying manifests unnecessarily.

However: we deliberately reapply manifests even if it looks unnecessary, to revert changes that have been made out-of-band. For example, if someone patches a resource with kubectl behind flux's back, we want to fairly promptly reassert the definition from the source of truth (i.e., git).

We still might want to report on things that look out of date, so I'm inclined to leave the checksums in, even if we don't use them in control flow.

squaremo

I've done an initial scan over this, and made some comments on where things seem a bit out of place. It's all up for discussion :-)

cluster/kubernetes/kubernetes.go

sync/sync.go

cluster/cluster.go

squaremo · 2018-10-23T12:24:29Z

Put behind a formal flag, e.g. --experimental-deletion

I've added a flag --sync-garbage-collection to switch on the deletion part of the syncing.

cluster.Cluster's Sync method should probably receive a more appropriate object than
2x map[string]policy.Update

Resources are collected into named stacks and given a checksum before being passed to cluster.Sync. (The stacks and checksums are passed as part of the SyncDef argument).

cluster.applyMetadata should be refactored and simplified

I rewrote it to do the minimal amount of {de,en}coding.

cluster.ExportByLabel should probably be given a few failure/retry cases to handle; what about when we're limited by RBAC?

Good question; I've punted on that for the minute.

Skipped resources (v1:ComponentStatus and v1:Endpoints) shouldn't happen at listing level, but probably delete/diff level

Anything that wasn't given a stack label in the first place will be left alone. There will need to be additional code to deal with anything that is created by another controller and given the stack label (possibly examining the ownerReferences).

Should sync.garbageCollect mark the object for deletion and only prune it next run? This could help with any race conditions in the K8s API server (I didn't run into any, but you never know).

It might make any race conditions less likely to bite, but it wouldn't remove them. In any case, I think once kubectl apply has succeeded, the resource can be considered durably updated.

Logger is broken in sync.Sync so fmt.Printf had to be used. We should figure this out.

I've replaced the printfs with logger.Log. What may have been happening is that Log will return an error if its invoked with an odd number of arguments (since it expects label, value pairs); and we ignore the return value, so it fails silently.

Tests, tests, tests.

Yup. I need to bring the existing tests up to date; then we'll have to put in new ones that cover the various scenarios of moving, removing, adding, resources.

flux.weave.works/immortal

I think flux.weave.works/ignore would suffice for now; but, it may be a requirement for some people that a resource is updated (not ignored) while it is represented in the git repo, but not deleted if it's removed.

squaremo · 2018-10-23T12:54:57Z

While working on those commits, I did start wondering why stacks exist. I accept the argument for limiting the "blast radius" of changes (so e.g., every time you change a label somewhere, it doesn't result in every single resource being updated). But why not limit the blast radius to a single resource?

Timer · 2018-10-24T22:43:16Z

Thanks for the extensive updates! Sorry I don't have more time to work on this right now. 😅

A hash per-resource would probably work well. This was originally based on prior art that had the stack concept, more akin to Helm/Charts.
In this case, the user is defining their resources and not having them arbitrarily created so a hash per resource would probably work nicely.

squaremo · 2018-11-08T17:26:51Z

I've rebased this and made sure the tests run and pass. The latter meant losing some coverage -- specifically, testing whether deleting a resource results in the cluster resource being deleted. I'll have to build up the tests in cluster/kubernetes to cover that.

Timer · 2018-11-17T02:05:34Z

Anything I can help with to push this along? :-)

I think I have some free time coming up.

Timer · 2018-11-17T03:00:30Z

Upgraded to this new version in my cluster and it's working great!

squaremo · 2018-11-19T10:45:13Z

Anything I can help with to push this along? :-)

I've added some test scaffolding in sync_test.go. This exercises pretty much the bare minimum of the sync (and deletion). What would really be useful is to expand the cases covered, especially the corners. For example, I have no tests checking what happens with ignored resources (in fact I don't think they are accounted for in the code, so we'll need to come up with the semantics as well).

squaremo · 2018-11-21T15:56:07Z

Idea: use a sanitised git repo URL as the stack name. That means that when flux supports multiple repos, it will be able to do syncs piece-wise, a repo at a time, and not delete something that simply came from another repo.

Technically, we can partition and name the stacks whatever we like. But it might be nice to have a label saying which repo a resource came from, for other purposes. And, it's useful for the stack to be a less than or equal to the unit of syncing; otherwise, there might be glitches where, say, a resource gets deleted because it has changed stack but that hasn't been applied yet. (Syncing per repo will have this issue, if a resource moves from one repo to another -- but since repos are updated independently, so will syncing all repos at once).

squaremo · 2018-11-27T15:28:49Z

@Timer I have a rebased branch that squashes the commits you marked as fix: ... into their predecessors. Mind if I force-push it?

Timer · 2018-11-27T15:54:33Z

Feel free to force push anything you'd like. :-)

squaremo · 2018-12-20T14:18:47Z

I think this is ready for release as an experimental feature (i.e., can be reviewed and merged).

To address the TODOs:

Put behind a formal flag, e.g. --experimental-deletion
cluster.Cluster's Sync method should probably receive a more appropriate object than 2x map[string]policy.Update
cluster.applyMetadata should be refactored and simplified
Should sync.garbageCollect mark the object for deletion and only prune it next run? This could help with any race conditions in the K8s API server (I didn't run into any, but you never know).
Logger is broken in sync.Sync so fmt.Printf had to be used. We should figure this out.
Tests, tests, tests.
flux.weave.works/immortal

The annotation "flux.weave.works/ignore" will effectively make something immortal -- flux won't apply it, and (if on a resource in the cluster) won't delete it, even if it's removed from the repo.
These are explained in comments above (or commit comments).

- cluster.ExportByLabel should probably be given a few failure/retry cases to handle; what about when we're limited by RBAC?

Accounting for the possibilities is a bit mind-boggling. I think it might be easier to release this as an experimental feature, and try it in the real environments (by which I mean disposable environments).

- Skipped resources (v1:ComponentStatus and v1:Endpoints) shouldn't happen at listing level, but probably delete/diff level

I'm not convinced we have to skip any resources at all, for correctness. We can generalise or embellish this later if necessary, though.

squaremo · 2019-01-02T12:16:02Z

@rndstr Do you mind having a look at this? I won't hold you to a definitive approval, unless you want to give one :-)

rndstr · 2019-01-03T21:40:45Z

@squaremo i likely won't be able to give it a proper look before next Tuesday

rndstr · 2019-01-09T01:19:17Z

sync/sync.go

 			logger.Log("resource", res.ResourceID(), "ignore", "apply")
-			return
+			continue


sync/sync.go

cluster/kubernetes/sync.go

rndstr

played around with it some more locally and it works great.

squaremo · 2019-01-10T14:25:32Z

Brilliant, thanks for the review Roli! I appreciate you taking the time to check this out :-)

I had another play myself and I have noticed something: resources that aren't namespaced will get deleted even if they are among the files. The cause is that when loaded from files, resources get given the "default" namespace if they don't specify one. But when loaded from the cluster, they will either have a namespace (because they've been created in a namespace implicitly) or have an empty "" namespace. Since we identify resources using the namespace (and the kind and the name), when the garbage collection goes to look at which resources are in the cluster but not in the files, it will think they are different, and delete them.

Assigning the namespace "default" to resources missing a namespace is wrong, since they end being in another namespace; but so is leaving that field empty, or giving them a sentinel value (e.g., <default>, for the same reason. Finding out the default namespace in the kubectl config and using that will have a similar problem for un-namespaced resources -- they will come back from the cluster with an empty namespace.

I don't think there's anything for it other than to figure out which resources are supposed to have a namespace, and filling in the default value. Ideally we'd be able to do that in a place we're already querying for resources.

cluster/kubernetes/namespacer.go

2opremio · 2019-02-26T18:46:00Z

I think all the comments have been addressed. The only missing thing is documentation.

site/faq.md

theduke · 2019-03-06T10:36:31Z

Just wanted to give a shoutout and thanks to @Timer and @squaremo , implementation of this feature is much appreciated!

The PR #1442 introduce code to determine which namespace, if any, each manifest belongs to. To distinguish between resources that need a namespace but don't have one, and resources that are cluster-scoped, it introduced the sentinel value `<cluster>` for the latter. Regrettably, I didn't accompany this with code for _parsing_ those sentinel values, since I reasoned that it would only be used internally. But the sync events generated by fluxd include a list of changed resources, and those inevitably will include things like namespaces that are cluster-scoped. The result is that fluxd will generate events that cannot then be parsed by the receiver. This commit fixes that by recognising `<cluster>` as a namespace when parsing resource IDs.

This fixes a regression introduced by fluxcd#1442 in which applying the `tag_all` pseudo policy failed for workload manifests in which the namespace is ommited. The effective namespace of the workload wasn't set after parsing, causing a mismatch in the resouce identifier (the parsed resource indentifier was cluster-scoped due to the lack of explicit namespace in the manifest).

This fixes a regression introduced by fluxcd#1442 in which applying the `tag_all` pseudo policy failed for workload manifests in which the namespace is omitted. The effective namespace of the workload wasn't set after parsing, causing a mismatch in the resource identifier (the parsed resource identifier was cluster-scoped due to the lack of explicit namespace in the manifest).

This fixes a regression introduced by #1442 in which applying the `tag_all` pseudo policy failed for workload manifests in which the namespace is omitted. The effective namespace of the workload wasn't set after parsing, causing a mismatch in the resource identifier (the parsed resource identifier was cluster-scoped due to the lack of explicit namespace in the manifest).

Timer force-pushed the feature/stack-tracking branch from f58c1e8 to e04b4e1 Compare October 11, 2018 15:14

Timer force-pushed the feature/stack-tracking branch 3 times, most recently from 8acf3da to 066c2da Compare October 11, 2018 23:50

squaremo reviewed Oct 15, 2018

View reviewed changes

cluster/kubernetes/kubernetes.go Outdated Show resolved Hide resolved

cluster/kubernetes/kubernetes.go Outdated Show resolved Hide resolved

sync/sync.go Outdated Show resolved Hide resolved

sync/sync.go Outdated Show resolved Hide resolved

cluster/cluster.go Outdated Show resolved Hide resolved

squaremo changed the title ~~[WIP] Stack tracking~~ [WIP] Delete resources no longer in git Oct 19, 2018

squaremo force-pushed the feature/stack-tracking branch from 42bb01a to 41dbe4e Compare November 8, 2018 17:25

squaremo force-pushed the feature/stack-tracking branch 2 times, most recently from 1d629b0 to 9aa11a1 Compare December 20, 2018 14:06

squaremo changed the title ~~[WIP] Delete resources no longer in git~~ Delete resources no longer in git Dec 20, 2018

rndstr reviewed Jan 9, 2019

View reviewed changes

rndstr approved these changes Jan 9, 2019

View reviewed changes

Delegate GroupVersion parsing to schema.ParseGroupVersion

1d86d57

2opremio force-pushed the feature/stack-tracking branch 2 times, most recently from f11b18c to 1d86d57 Compare February 26, 2019 14:34

squaremo mentioned this pull request Feb 26, 2019

Stale image #1758

Closed

2opremio force-pushed the feature/stack-tracking branch 2 times, most recently from 08b4ff5 to a6f0142 Compare February 26, 2019 16:41

squaremo reviewed Feb 26, 2019

View reviewed changes

cluster/kubernetes/namespacer.go Outdated Show resolved Hide resolved

Obtain the default namespace directly from kubeconfig

aeef2c6

2opremio force-pushed the feature/stack-tracking branch from a6f0142 to aeef2c6 Compare February 26, 2019 16:48

Factor out garbage-collection code

c6e2c4e

hiddeco reviewed Feb 27, 2019

View reviewed changes

site/faq.md Outdated Show resolved Hide resolved

Document the experimental garbage collection feature

e949383

squaremo force-pushed the feature/stack-tracking branch from 7bdd82d to e949383 Compare February 27, 2019 12:04

2opremio approved these changes Feb 27, 2019

View reviewed changes

squaremo merged commit d76ecab into fluxcd:master Feb 27, 2019

squaremo mentioned this pull request Feb 28, 2019

Annotate resources outside of HelmRelease NS #1757

Merged

squaremo mentioned this pull request Mar 14, 2019

flux 1.11.0 no longer syncs without ClusterRole #1830

Closed

squaremo mentioned this pull request Mar 21, 2019

Admit "<cluster>" when parsing ResourceIDs #1851

Merged

2opremio mentioned this pull request Apr 4, 2019

Fix tag_all pseudo policy on default-namespaced wokloads #1901

Merged

squaremo mentioned this pull request Apr 26, 2019

Deletion of resources not in repo #738

Closed

semyonslepov mentioned this pull request May 1, 2019

Allow to enable garbage collection in Helm chart #2004

Merged

squaremo mentioned this pull request Oct 7, 2019

Annotations are ignored for some kinds of resources when syncing #749

Closed

2opremio mentioned this pull request Nov 14, 2019

Gitops-engine branch doesn't set default namespaces properly #2614

Closed

squaremo mentioned this pull request Feb 19, 2020

Disable garbage collection for specific resource #2841

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Delete resources no longer in git #1442

Delete resources no longer in git #1442

Timer commented Oct 10, 2018 •

edited

Loading

oliviabarrick commented Oct 11, 2018

hiddeco commented Oct 11, 2018

oliviabarrick commented Oct 11, 2018

squaremo commented Oct 12, 2018

squaremo left a comment

squaremo commented Oct 23, 2018

squaremo commented Oct 23, 2018

Timer commented Oct 24, 2018 •

edited

Loading

squaremo commented Nov 8, 2018 •

edited

Loading

Timer commented Nov 17, 2018

Timer commented Nov 17, 2018

squaremo commented Nov 19, 2018

squaremo commented Nov 21, 2018

squaremo commented Nov 27, 2018

Timer commented Nov 27, 2018

squaremo commented Dec 20, 2018 •

edited

Loading

squaremo commented Jan 2, 2019

rndstr commented Jan 3, 2019

rndstr Jan 9, 2019

rndstr left a comment

squaremo commented Jan 10, 2019

2opremio commented Feb 26, 2019

theduke commented Mar 6, 2019 •

edited

Loading

Delete resources no longer in git #1442

Delete resources no longer in git #1442

Conversation

Timer commented Oct 10, 2018 • edited Loading

Current Test Image

TODOs

Post-merge follow-ups

oliviabarrick commented Oct 11, 2018

hiddeco commented Oct 11, 2018

oliviabarrick commented Oct 11, 2018

squaremo commented Oct 12, 2018

squaremo left a comment

Choose a reason for hiding this comment

squaremo commented Oct 23, 2018

squaremo commented Oct 23, 2018

Timer commented Oct 24, 2018 • edited Loading

squaremo commented Nov 8, 2018 • edited Loading

Timer commented Nov 17, 2018

Timer commented Nov 17, 2018

squaremo commented Nov 19, 2018

squaremo commented Nov 21, 2018

squaremo commented Nov 27, 2018

Timer commented Nov 27, 2018

squaremo commented Dec 20, 2018 • edited Loading

squaremo commented Jan 2, 2019

rndstr commented Jan 3, 2019

rndstr Jan 9, 2019

Choose a reason for hiding this comment

rndstr left a comment

Choose a reason for hiding this comment

squaremo commented Jan 10, 2019

2opremio commented Feb 26, 2019

theduke commented Mar 6, 2019 • edited Loading

Timer commented Oct 10, 2018 •

edited

Loading

Timer commented Oct 24, 2018 •

edited

Loading

squaremo commented Nov 8, 2018 •

edited

Loading

squaremo commented Dec 20, 2018 •

edited

Loading

theduke commented Mar 6, 2019 •

edited

Loading