-
-
Notifications
You must be signed in to change notification settings - Fork 562
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
scancode-toolkit-31.0.2 returns an unknown-license-reference just before the bzip2-libbzip-2010 text #3080
Comments
@DennisClark this is already fixed in the LicenseDetection branch for the upcoming release: https://github.com/nexB/scancode-toolkit/tree/add-license-detection. Similar to Here the detection rule is New license detection looks like this:
There was also a bug related to how we group matches into LicenseDetection, I have solved this to factor in license intros when doing this grouping. Here are the scan results for you to look at: Old scan just this issue: New scan just this issue: Old scan entire file: New scan entire file: |
I scanned doris-1.1.1-rc03 ( available at https://github.com/apache/doris/archive/refs/tags/1.1.1-rc03.tar.gz )
using scancode-toolkit-31.0.2
and although it detected most of the licenses in the rather complex notice (attached) in
doris-1.1.1-rc03/dist/LICENSE-dist.txt
it returns both unknown-license-reference and bzip2-libbzip-2010 for this chunk of text:
See lines 9190 through 9257 in the attached scan results to see both detection instances.
Apparently the "licensed under the following terms:" text snippet misled the scan logic, even though it found the bzip2-libbzip-2010 license correctly right after that. There is no reason to return unknown-license-reference for the introductory sentence, which is primarily to provide clarity to the reader of the file.
LICENSE-dist.txt.zip
doris-1.1.1-rc03-results.json.zip
The text was updated successfully, but these errors were encountered: