🐛 Pinned-Dependencies continues on error #3515

pnacht · 2023-09-25T23:05:44Z

What kind of change does this PR introduce?

PR title follows the guidelines defined in our pull request documentation

What is the current behavior?

Whenever Pinned-Dependencies hits a runtime error (i.e. can't detect job OS, can't parse a Dockerfile, etc), the check crashes entirely and the project gets an inconclusive score for the check.

What is the new behavior (if this is a feature change)?**

If Pinned-Dependencies hits a runtime error, the "element" that caused the error (i.e. workflow job, Dockerfile) is skipped and the check progresses as well as possible.

TO-DO

At this stage of the PR, the proposed change only applies to failures to detect a job's operating system (as happens with apache/beam). The idea is for this to cover other similar cases (i.e. apache/arrow, caused by an error parsing a Dockerfile).

Also, at this stage, the failure to detect a job's operating system isn't logged anywhere. The job is simply skipped "invisibly". This is useful information that should be displayed as a warning in the check's details. However, there's currently no place to store this information since PinningDependenciesData only stores data on problematic dependencies, not on problems encountered while investigating.

The most straightforward solution I see is to modify PinningDependenciesData to also contain new SkippedWorkflowJobs and SkippedFiles fields (each a struct containing the job/filename and skip reason). The data in these fields can later be added to the logs as warnings, i.e:

Warn: myWorkflow.yaml's job 'myJob' skipped: could not determine operating system

The job/filenames can be either parsed from the error message or a new error type can be defined which also includes this metadata. A type assertion can then be used to identify such cases and extract the data from the error. (I don't know where I'd store this new error type)

However, I'm no expert in either Go or the Scorecard codebase, so would like to get feedback on this solution before implementing it.

Tests for the changes have been added (for bug fixes/features)

I have added the same test workflow to multiple test functions in raw/pinned-dependencies-test.go. The results show that a problematic job affects non-GHA-pinning scores, but GHA-pinning is unaffected.

Notes for the reviewer

I couldn't see where to neatly add anything regarding this change to the Pinned-Deps documentation (which is somewhat high-level, without much detail on how the score is calculated). I have therefore left the docs as-is, without changes. Let me know if you wish to add a comment on such "error-handling" to the check's docs.

Which issue(s) this PR fixes

Fixes #3316.

Does this PR introduce a user-facing change?

Pinned-Dependencies now continues after encountering runtime errors

codecov · 2023-09-25T23:08:33Z

Codecov Report

Merging #3515 (5705652) into main (e16d3e3) will decrease coverage by 5.62%.
The diff coverage is 93.22%.

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #3515      +/-   ##
==========================================
- Coverage   76.01%   70.40%   -5.62%     
==========================================
  Files         206      206              
  Lines       14003    14053      +50     
==========================================
- Hits        10645     9894     -751     
- Misses       2727     3585     +858     
+ Partials      631      574      -57

spencerschrock · 2023-09-27T16:32:30Z

If Pinned-Dependencies hits a runtime error, the "element" that caused the error (i.e. workflow job, Dockerfile) is skipped and the check progresses as well as possible.

I think this is fine. There's the chance of missing dangerous workflows this way, but if we log it (either in the execution, or as a detail) then hopefully we still get bug reports.

The most straightforward solution I see is to modify PinningDependenciesData

I think the simplest is to just throw in a log that mentions it, but I can see the value in having it in the details. We currently handle some parse errors just as a silent debug statement:

scorecard/checks/evaluation/pinned_dependencies.go

Lines 80 to 89 in 7034306

    
           if rr.Msg != nil { 
        
           	dl.Debug(&checker.LogMessage{ 
        
           		Path:      rr.Location.Path, 
        
           		Type:      rr.Location.Type, 
        
           		Offset:    rr.Location.Offset, 
        
           		EndOffset: rr.Location.EndOffset, 
        
           		Text:      *rr.Msg, 
        
           		Snippet:   rr.Location.Snippet, 
        
           	}) 
        
           	continue

A type assertion can then be used to identify such cases and extract the data from the error

These days, it's usually done with errors.As

Of course the other option is to fix the root cause of #3316, instead of skipping over it. But that may be a more complicated fix. Any thoughts @laurentsimon @raghavkaul ?

github-actions · 2023-10-08T01:46:37Z

Stale pull request message

pnacht · 2023-10-20T23:27:17Z

I think the simplest is to just throw in a log that mentions it, but I can see the value in having it in the details. We currently handle some parse errors just as a silent debug statement:

scorecard/checks/evaluation/pinned_dependencies.go

Lines 80 to 89 in 7034306

if rr.Msg != nil {

dl.Debug(&checker.LogMessage{

Path: rr.Location.Path,

Type: rr.Location.Type,

Offset: rr.Location.Offset,

EndOffset: rr.Location.EndOffset,

Text: *rr.Msg,

Snippet: rr.Location.Snippet,

})

continue

I've just pushed some commits such that the skipped steps are displayed in the details.

Currently, scorecard --repo apache/beam --checks Pinned-Dependencies [...] | jq '.checks[0]' returns:

{
  "details": null,
  "score": -1,
  "reason": "internal error: internal error: unable to determine OS for job: Build python wheels on ${{matrix.arch}} for ${{ matrix.os_python.os }}",
}

With the proposed change, the score is now correctly calculated (skipping the problematic steps), and everything that had to be skipped is logged in the details.

{
  "details": [
    "Warn: Possibly incomplete results: error parsing job operating system: .github/workflows/build_wheels.yml's job 'Build python wheels on ${{matrix.arch}} for ${{ matrix.os_python.os }}' (step 5)",
    "Warn: Possibly incomplete results: error parsing job operating system: .github/workflows/build_wheels.yml's job 'Build python wheels on ${{matrix.arch}} for ${{ matrix.os_python.os }}' (step 7)",
    "Warn: GitHub-owned GitHubAction not pinned by hash: .github/workflows/assign_milestone.yml:34: update your workflow using https://app.stepsecurity.io/secureworkflow/apache/beam/assign_milestone.yml/master?enable=pin",
    "... 1000+ other warnings ..."
  ],
  "score": 0,
  "reason": "dependency not pinned by hash detected -- score normalized to 0",
}

It was also quite straightforward to do the same for the apache/arrow error, so I've implemented that as well.

scorecard --repo apache/arrow --checks Pinned-Dependencies [...] | jq '.checks[0]' before:

Error: check runtime error: Pinned-Dependencies: internal error: error parsing shell code: ci/docker/python-wheel-windows-test-vs2017.dockerfile:1:2: "if <cond>" must be followed by "then"
2023/10/20 23:06:16 error during command execution: check runtime error: Pinned-Dependencies: internal error: error parsing shell code: ci/docker/python-wheel-windows-test-vs2017.dockerfile:1:2: "if <cond>" must be followed by "then"

{
  "details": null,
  "score": -1,
  "reason": "internal error: error parsing shell code: ci/docker/python-wheel-windows-test-vs2017.dockerfile:1:2: \"if <cond>\" must be followed by \"then\"",
}

Now:

{
  "details": [
    "Warn: Possibly incomplete results: error parsing shell code: ci/docker/python-wheel-windows-test-vs2017.dockerfile:1:2: \"if \u003ccond\u003e\" must be followed by \"then\"",
    "Warn: Possibly incomplete results: error parsing shell code: ci/docker/python-wheel-windows-vs2017.dockerfile:1:2: \"if \u003ccond\u003e\" must be followed by \"then\"",
    "Warn: GitHub-owned GitHubAction not pinned by hash: .github/workflows/archery.yml:53: update your workflow using https://app.stepsecurity.io/secureworkflow/apache/arrow/archery.yml/main?enable=pin",
    "... 300+ warnings ..."
  ],
  "score": 0,
  "reason": "dependency not pinned by hash detected -- score normalized to 0",
}

I've seen a few other places where this maybe could also be applied, but I haven't dug into them to be sure yet.

These days, it's usually done with errors.As

Thanks for the tip, I used it! (I'm still getting my golang sea legs...)

Of course the other option is to fix the root cause of #3316, instead of skipping over it. But that may be a more complicated fix. Any thoughts @laurentsimon @raghavkaul ?

The issue here isn't necessarily in Scorecard, though:

for apache/beam, the problem is a failure to identify a job's OS if the matrix is too complex (matrix variables that are dictionaries). That could probably be figured out within Scorecard.
however, for apache/arrow, the error is thrown directly by mvdan.cc/sh/v3/syntax.

spencerschrock · 2023-10-24T17:42:08Z

With the proposed change, the score is now correctly calculated (skipping the problematic steps), and everything that had to be skipped is logged in the details.

I'd lean towards Info instead of warn? just so it doesn't end up as something in the security dashboard?

for apache/beam, the problem is a failure to identify a job's OS if the matrix is too complex (matrix variables that are dictionaries). That could probably be figured out within Scorecard.

I have a change I was playing around with, but haven't had time to finish testing.

however, for apache/arrow, the error is thrown directly by mvdan.cc/sh/v3/syntax.

Because we don't have a shell parser for powershell. So partially scorecard's fault

checker/raw_result.go

laurentsimon · 2023-10-26T16:07:43Z

If Pinned-Dependencies hits a runtime error, the "element" that caused the error (i.e. workflow job, Dockerfile) is skipped and the check progresses as well as possible.

I think this is fine. There's the chance of missing dangerous workflows this way, but if we log it (either in the execution, or as a detail) then hopefully we still get bug reports.

The most straightforward solution I see is to modify PinningDependenciesData

I think the simplest is to just throw in a log that mentions it, but I can see the value in having it in the details. We currently handle some parse errors just as a silent debug statement:

scorecard/checks/evaluation/pinned_dependencies.go

Lines 80 to 89 in 7034306

if rr.Msg != nil {

dl.Debug(&checker.LogMessage{

Path: rr.Location.Path,

Type: rr.Location.Type,

Offset: rr.Location.Offset,

EndOffset: rr.Location.EndOffset,

Text: *rr.Msg,

Snippet: rr.Location.Snippet,

})

continue

A type assertion can then be used to identify such cases and extract the data from the error

These days, it's usually done with errors.As

Of course the other option is to fix the root cause of #3316, instead of skipping over it. But that may be a more complicated fix. Any thoughts @laurentsimon @raghavkaul ?

I'm fine with logging only for now, until we have better parsing code

checker/raw_result.go

checks/evaluation/pinned_dependencies_test.go

checks/raw/pinned_dependencies_test.go

errors/public.go