Add daily check for vulnerability issues using Trivy #600

caponetto · 2024-06-28T14:50:34Z

https://issues.redhat.com/browse/RHOAIENG-8779

Description

This PR introduces the usage of Trivy to scan the container images and consolidate vulnerability reports. I've configured the Build Notebooks workflow to be triggered daily at 2 am UTC to generate the report (the scan runs after each image is built). I didn't see the need to run it on each push but we can enable it if it makes sense. Each report is uploaded to the workflow summary as soon as each individual job is completed. Finally, reports follow a markdown template.

Demo

Screen.Recording.2024-06-28.at.11.27.45.mp4

Here is an example of workflow run where you can see real reports.

Note: I haven't added the checks for PRs because the scan operation adds an extra ~10 minutes, which would slow our PR jobs down.

How Has This Been Tested?

I've executed the workflow on my fork.

Merge criteria:

The commits are squashed in a cohesive manner and have meaningful messages.
Testing instructions have been added in the PR body (for PRs involving changes that are not immediately obvious).
The developer has manually tested the changes and verified that the changes work

.github/workflows/build-notebooks-TEMPLATE.yaml

ci/trivy-markdown.tpl

jiridanek

Just a few small suggestions. Over all, lgtm.

We can't fail the build when vulnerabilities are detected, that way we'd never have a pass. But these inforeports are not very practical to work with either, there is too much info to scan. We'd need to either reduce the number of findings the tool produces, so it is possible to read through it easily.

Currently, this is mainly useful to check that some vulnerability we intended to fix was indeed fixed. But, then I'd have to first merge the PR to get scan on it. (Or run scan on my machine.)

It does not seem very actionable to me right away, but it is a great capability to have around. And, I did not know the github feature https://docs.github.com/en/actions/using-workflows/workflow-commands-for-github-actions#adding-a-job-summary, so that's interesting for me too.

caponetto · 2024-06-28T16:12:25Z

@jiridanek thanks for the review!

We can't fail the build when vulnerabilities are detected, that way we'd never have a pass. But these inforeports are not very practical to work with either, there is too much info to scan. We'd need to either reduce the number of findings the tool produces, so it is possible to read through it easily.

Yeah, I agree. I've reduced the scope to only HIGH and CRITICAL issues because the list was getting bigger if we allowed all of them. But let's see if it helps otherwise we can revisit it and try to personalize more.

Currently, this is mainly useful to check that some vulnerability we intended to fix was indeed fixed. But, then I'd have to first merge the PR to get scan on it. (Or run scan on my machine.)

We can't enable it to run for all PRs due to the amount of time it adds but now I'm thinking if we can come up with some strategy like running it if the PR has a certain label or something like that.

.github/workflows/build-notebooks-TEMPLATE.yaml

caponetto · 2024-07-01T11:41:22Z

Ok, this PR is ready.

atheo89

Great CVE report tool! I have a question: Do we push the images to ghcr after completing the report? If so, I would suggest not doing it because we will fill up the registry on a daily basis.

jiridanek · 2024-07-02T08:53:18Z

Do we push the images to ghcr after completing the report? If so, I would suggest not doing it because we will fill up the registry on a daily basis.

we have to push, the makefile is written in such a way that it builds and pushes, without the possibility to skip a push

notebooks/Makefile

Lines 61 to 65 in 3f93529

    
           define image 
        
           	$(info #*# Image build directory: <$(2)> #(MACHINE-PARSED LINE)#*#...) 
        
           	$(call build_image,$(1),$(2),$(3)) 
        
           	$(call push_image,$(1)) 
        
           endef

regarding filling up space, first, there does not seem to be a limit that applies to us, and second, please review this PR that sets up a cleaning job to free up the space daily

ci: implement ghcr.io expiration for images and cache layers #601

caponetto · 2024-07-02T09:31:50Z

@atheo89 Thanks for raising this concern! Another option is to use the approach we have for pull_request, which publishes the image to localhost (ref) but I think it won't be an issue anymore after @jiridanek's PR that cleans up the registry periodically. Plus, it could be helpful to have them published somewhere if someone needs to debug the associated image with the report.

atheo89 · 2024-07-02T09:44:53Z

Great! Let's make sure enough time passes before Jiri's prune workflow starts! 🙂

https://github.com/opendatahub-io/notebooks/pull/601/files#diff-6a2b18c5159aa29ecd02344b37be379eac595c441858bd84f754a0353e0bb5caR12

/lgtm
/approve

openshift-ci · 2024-07-02T09:45:00Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: atheo89, jiridanek

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [atheo89]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

atheo89 · 2024-07-02T09:55:28Z

we have to push, the makefile is written in such a way that it builds and pushes, without the possibility to skip a push

That's true; but it's not a big deal to make recipes more independent if that makes our lives easier. Added a tracker here

openshift-ci bot requested review from atheo89 and dibryant June 28, 2024 14:50

jiridanek reviewed Jun 28, 2024

View reviewed changes

.github/workflows/build-notebooks-TEMPLATE.yaml Show resolved Hide resolved

jiridanek reviewed Jun 28, 2024

View reviewed changes

ci/trivy-markdown.tpl Show resolved Hide resolved

jiridanek approved these changes Jun 28, 2024

View reviewed changes

openshift-ci bot assigned jiridanek Jun 28, 2024

openshift-ci bot added the lgtm label Jun 28, 2024

jiridanek reviewed Jun 28, 2024

View reviewed changes

.github/workflows/build-notebooks-TEMPLATE.yaml Outdated Show resolved Hide resolved

caponetto force-pushed the RHOAIENG-8779 branch from 23bf487 to 1e5f861 Compare June 28, 2024 18:16

openshift-ci bot removed the lgtm label Jun 28, 2024

caponetto added the do-not-merge/hold label Jun 28, 2024

Add daily check for vulnerability issues using Trivy

bc66678

caponetto force-pushed the RHOAIENG-8779 branch from 1e5f861 to bc66678 Compare July 1, 2024 11:35

caponetto removed the do-not-merge/hold label Jul 1, 2024

jiridanek approved these changes Jul 1, 2024

View reviewed changes

openshift-ci bot added the lgtm label Jul 1, 2024

atheo89 reviewed Jul 2, 2024

View reviewed changes

openshift-ci bot assigned atheo89 Jul 2, 2024

openshift-ci bot added the approved label Jul 2, 2024

openshift-merge-bot bot merged commit d7b7438 into opendatahub-io:main Jul 2, 2024
6 checks passed

jiridanek mentioned this pull request Jul 3, 2024

ci: start podman.socket and pass it to trivy to avoid unnecessary pulls #605

Merged

3 tasks

jiridanek mentioned this pull request Aug 1, 2024

RHOAIENG-9822: chore(Makefile): allow not pushing built images in Makefile and allow skipping building dependent images #657

Merged

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add daily check for vulnerability issues using Trivy #600

Add daily check for vulnerability issues using Trivy #600

caponetto commented Jun 28, 2024

jiridanek left a comment

caponetto commented Jun 28, 2024

caponetto commented Jul 1, 2024

atheo89 left a comment

jiridanek commented Jul 2, 2024

caponetto commented Jul 2, 2024

atheo89 commented Jul 2, 2024

openshift-ci bot commented Jul 2, 2024

atheo89 commented Jul 2, 2024

Add daily check for vulnerability issues using Trivy #600

Add daily check for vulnerability issues using Trivy #600

Conversation

caponetto commented Jun 28, 2024

Description

How Has This Been Tested?

Merge criteria:

jiridanek left a comment

Choose a reason for hiding this comment

caponetto commented Jun 28, 2024

caponetto commented Jul 1, 2024

atheo89 left a comment

Choose a reason for hiding this comment

jiridanek commented Jul 2, 2024

caponetto commented Jul 2, 2024

atheo89 commented Jul 2, 2024

openshift-ci bot commented Jul 2, 2024

atheo89 commented Jul 2, 2024