Feature: more robust way of logging errors in sub-checks #1327

laurentsimon · 2021-11-22T17:05:15Z

Today, we return an inconclusive results (accompanied by an error) if a sub-check catches an error. For example, the Pinned-Dependencies check is currently fairly (too!) large, so a single error makes the entire check return -1.

In #1324 (comment), we implemented a heuristic to avoid running the sub-check, using the filename. I feel this approach is fragile. Dockerfile templates are common, and are not always called template, tpl, etc. See #710 and https://raw.githubusercontent.com/kubernetes/kubernetes/master/build/server-image/Dockerfile.

It'd be useful to have a more robust solution. I'n not sure what a better solution is.
Maybe we should create an Error() function to log errors in sub-checks without affecting the entire check result.

Any other ideas?

The text was updated successfully, but these errors were encountered:

laurentsimon · 2021-11-22T17:05:27Z

@chrismcgehee @azeemsgoogle @naveensrinivasan

laurentsimon · 2021-11-22T23:54:03Z

relevant for shell parsing #1307

ristomcgehee · 2021-11-23T04:15:10Z

I do think we should simply log when we can't parse a file instead of failing the entire check, essentially do what #1312 does but for any parsing error. It would be nice if we could examine the files that can't be parsed in case we can improve our code to parse better, but inevitably we're going to encounter files that are formatted incorrectly. Maybe we can keep a list of files that we know are not parseable, and just ignore them.

In regards to skipping template Dockerfiles, I think using the filename is a decent start. The file https://raw.githubusercontent.com/kubernetes/kubernetes/master/build/server-image/Dockerfile is not, strictly speaking, a template; it is a fully valid Dockerfile, just with build arguments. To handle a file like this would should enhance are logic to look for hashes as you suggest in #710 (comment), but we shouldn't skip it because it's a template.

laurentsimon · 2021-12-02T17:32:32Z

I do think we should simply log when we can't parse a file instead of failing the entire check, essentially do what #1312 does but for any parsing error. It would be nice if we could examine the files that can't be parsed in case we can improve our code to parse better, but inevitably we're going to encounter files that are formatted incorrectly. Maybe we can keep a list of files that we know are not parseable, and just ignore them.

that'd be great, yes!

In regards to skipping template Dockerfiles, I think using the filename is a decent start. The file https://raw.githubusercontent.com/kubernetes/kubernetes/master/build/server-image/Dockerfile is not, strictly speaking, a template; it is a fully valid Dockerfile, just with build arguments. To handle a file like this would should enhance are logic to look for hashes as you suggest in #710 (comment), but we shouldn't skip it because it's a template.

Do you know if developers commit their generated dockerfiles from a template?
Can you list a few template dockerfiles that caused problem for scorecard?

In the next iteration of the check, would is still be useful to try to parse the templates and just log a message of parsing fails (like you suggest above)?

ristomcgehee · 2021-12-05T04:41:25Z

Do you know if developers commit their generated dockerfiles from a template?

I think typically developers use a template dockerfile to build with different architectures or different distros as part of a pipeline. They might sometimes commit the dockerfiles generated from a template, but probably not usually.

Can you list a few template dockerfiles that caused problem for scorecard?

This is the only one I'm aware of that caused errors for scorecard:
https://github.com/traefik/mesh/blob/957da2d9ca4d4cdd4e00d0bdfbb9f5ae4f2685ff/tmpl.Dockerfile
Here's another that would cause have caused errors if we were scanning them:
https://github.com/caddyserver/caddy-docker/blob/62cc558526168fc627b898a3c548c20ce995a8d2/Dockerfile.tmpl

In the next iteration of the check, would is still be useful to try to parse the templates and just log a message of parsing fails (like you suggest above)?

I think that would be a better approach. It looks like some of the dockerfiles that we're now skipping (this one for example) are able to be parsed just fine.

laurentsimon · 2021-12-06T17:13:08Z

Do you know if developers commit their generated dockerfiles from a template?

I think typically developers use a template dockerfile to build with different architectures or different distros as part of a pipeline. They might sometimes commit the dockerfiles generated from a template, but probably not usually.

Can you list a few template dockerfiles that caused problem for scorecard?

This is the only one I'm aware of that caused errors for scorecard: https://github.com/traefik/mesh/blob/957da2d9ca4d4cdd4e00d0bdfbb9f5ae4f2685ff/tmpl.Dockerfile Here's another that would cause have caused errors if we were scanning them: https://github.com/caddyserver/caddy-docker/blob/62cc558526168fc627b898a3c548c20ce995a8d2/Dockerfile.tmpl

do you know if the templating engine is part of thee official docker tooling or is it provided by a third-party?

In the next iteration of the check, would is still be useful to try to parse the templates and just log a message of parsing fails (like you suggest above)?

I think that would be a better approach. It looks like some of the dockerfiles that we're now skipping (this one for example) are able to be parsed just fine.

ristomcgehee · 2021-12-11T02:51:23Z

do you know if the templating engine is part of thee official docker tooling or is it provided by a third-party?

It's third-party tooling.

laurentsimon · 2022-02-02T16:25:11Z

how about creating an Error() function to logs in addition to Warn() and Info()? It would allow logging the error and skip minor parsing issues without returning an error for the entire check. Wdut?

ristomcgehee · 2022-02-03T04:24:29Z

Yeah, I think that's a good approach.

laurentsimon · 2022-02-03T16:40:45Z

Added to discussion for next sync.

github-actions · 2023-11-04T01:46:19Z

This issue is stale because it has been open for 60 days with no activity.

laurentsimon added the kind/enhancement New feature or request label Nov 22, 2021

laurentsimon changed the title ~~Feature: more robust way of logging errors in checks~~ Feature: more robust way of logging errors in sub-checks Nov 22, 2021

laurentsimon added the needs discussion label Feb 3, 2022

justaugustus added this to Backlog in Scorecard via automation Feb 23, 2022

github-actions bot added the Stale label Nov 4, 2023

spencerschrock mentioned this issue Nov 6, 2023

🐛 Pinned-Dependencies continues on error #3515

Merged

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature: more robust way of logging errors in sub-checks #1327

Feature: more robust way of logging errors in sub-checks #1327

laurentsimon commented Nov 22, 2021

laurentsimon commented Nov 22, 2021

laurentsimon commented Nov 22, 2021

ristomcgehee commented Nov 23, 2021

laurentsimon commented Dec 2, 2021

ristomcgehee commented Dec 5, 2021

laurentsimon commented Dec 6, 2021

ristomcgehee commented Dec 11, 2021

laurentsimon commented Feb 2, 2022

ristomcgehee commented Feb 3, 2022

laurentsimon commented Feb 3, 2022 •

edited

Loading

github-actions bot commented Nov 4, 2023

Feature: more robust way of logging errors in sub-checks #1327

Feature: more robust way of logging errors in sub-checks #1327

Comments

laurentsimon commented Nov 22, 2021

laurentsimon commented Nov 22, 2021

laurentsimon commented Nov 22, 2021

ristomcgehee commented Nov 23, 2021

laurentsimon commented Dec 2, 2021

ristomcgehee commented Dec 5, 2021

laurentsimon commented Dec 6, 2021

ristomcgehee commented Dec 11, 2021

laurentsimon commented Feb 2, 2022

ristomcgehee commented Feb 3, 2022

laurentsimon commented Feb 3, 2022 • edited Loading

github-actions bot commented Nov 4, 2023

laurentsimon commented Feb 3, 2022 •

edited

Loading