Invoke Argo Rollouts AnalysisRuns as part of Freight verification #1257

jessesuen · 2023-12-08T22:48:59Z

Proposed Feature

Today, verifying a Freight is equivalent to waiting for the Argo CD application (if specified) to reach a healthy state.

We want to extend this to do other things, including:

running user-defined tests/scripts (e.g. kicking off a Kubernetes Job that must return success)
querying metrics (prometheus, datadog, custom API endpoints, etc...) and ensuring KPIs are met

The way we should support the above is to re-use Argo Rollout's Analysis feature which does pretty much exactly what we want.

Motivation

Today, CI is commonly handling the testing aspect of post-promotion. If Kargo is to replace all facets of promotion in a deployment pipeline, it must be able to run user-defined tests.

Suggested Implementation

The approach will be very similar to our Argo CD integration, which does the following:

After freight is promoted, we sync the ArgoCD app
Then we wait until the app is healthy before marking the freight as verified in the stage

The Argo Rollouts Analysis would be done similarly:

After freight is promoted, and after the Argo CD app is healthy, we kick off an AnalysisRun
Then we wait until AnalysisRun is successful before marking the freight as verified in the stage

Kargo will need to add an analysis stanza to the stage spec, where it references one or more AnalysisTemplates:

      analysis:
        templates:
        - templateName: success-rate
        args:
        - name: service-name
          value: guestbook-svc.default.svc.cluster.local

When the Stage reconcile is trying to verify Freight after promotion, it would create an AnalysisRun from the referenced AnalysisTemplate(s). This is how it is done in Rollouts (we would have similar logic):
https://github.com/argoproj/argo-rollouts/blob/54d83d676b10d8246766bdbd5d407c0d39e3d9d0/utils/analysis/helpers.go#L291-L322

One difference from the above logic, is that we should avoid import dependency on Argo Rollouts. So the above logic will need to be translated to deal with unstructured objects, so we preserve all fields of an AnalysisTemplate when converting it to an AnalysisRun.

We would also need an informer on analysisruns that waits until AnalysisRun status.phase is completed (just like we have an informer on applications that waits until applications are healthy.

geowalrus4gh · 2023-12-11T18:03:27Z

We usually do it in the Rollout spec. Any reason not to do this in here and move it to kargo level ?

jessesuen · 2023-12-11T20:02:00Z

I can think of several:

Rollouts operate on a single workload and during the update. You may need analysis to be performed at a conceptually higher level (e.g. after multiple microservices are deployed and reach healthy, or after the whole Argo CD application becomes healthy)
If you aren't using Argo Rollouts
If promotion is not even Kubernetes based, but is promoting config for something else (e.g. terraform, uploading assets to CDN).

geowalrus4gh · 2023-12-13T17:57:37Z

The first point is super valid! at least in our case. Fingers crossed!

krancour · 2023-12-23T00:45:09Z

This was closed by #1259

jessesuen added the kind/feature-request label Dec 8, 2023

github-actions bot added needs/priority needs/area labels Dec 8, 2023

jessesuen added priority/high area/controller labels Dec 8, 2023

github-actions bot removed needs/priority needs/area labels Dec 8, 2023

jessesuen assigned krancour Dec 8, 2023

krancour added this to the v0.3.0 milestone Dec 23, 2023

krancour closed this as completed Dec 23, 2023

krancour added kind/enhancement and removed kind/feature-request labels Jun 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Invoke Argo Rollouts AnalysisRuns as part of Freight verification #1257

Invoke Argo Rollouts AnalysisRuns as part of Freight verification #1257

jessesuen commented Dec 8, 2023 •

edited

Loading

geowalrus4gh commented Dec 11, 2023 •

edited

Loading

jessesuen commented Dec 11, 2023

geowalrus4gh commented Dec 13, 2023

krancour commented Dec 23, 2023

Invoke Argo Rollouts AnalysisRuns as part of Freight verification #1257

Invoke Argo Rollouts AnalysisRuns as part of Freight verification #1257

Comments

jessesuen commented Dec 8, 2023 • edited Loading

Proposed Feature

Motivation

Suggested Implementation

geowalrus4gh commented Dec 11, 2023 • edited Loading

jessesuen commented Dec 11, 2023

geowalrus4gh commented Dec 13, 2023

krancour commented Dec 23, 2023

jessesuen commented Dec 8, 2023 •

edited

Loading

geowalrus4gh commented Dec 11, 2023 •

edited

Loading