Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[MRG] add sourmash sig check for comparing picklists and databases #1907

Merged
merged 9 commits into from
Mar 30, 2022

Conversation

ctb
Copy link
Contributor

@ctb ctb commented Mar 29, 2022

Adds the sourmash sig check command to compare picklists and databases.

This PR builds on the manifest-as-database work from #1891. sig check makes it possible to output manifests for signatures that match a picklist, as well as extracting unmatched values from picklists.

sig check is extremely, extraordinarily useful when building large databases for which you have all of the signatures available ...somewhere! See #1671 for background and details.

Addresses #1671.
Closes #1652.

  • add docs for sig check
  • add usage etc for sig check
  • add sig check --fail-if-missing to error exit when missing elements
  • sig check should require manifests on input databases
  • provide shorthand (-m?) on sig check for --save-manifest
  • resolve CTB comment / SBT weirdness
  • test add_to_found
  • test _add_rows
  • test picklist not found
  • test no matches

@ctb ctb changed the title [WIP] add sourmash sig check for comparing/diffing picklists and manifests [WIP] add sourmash sig check for comparing picklists and manifests Mar 29, 2022
@ctb ctb changed the title [WIP] add sourmash sig check for comparing picklists and manifests [WIP] add sourmash sig check for comparing picklists and databases Mar 29, 2022
@ctb ctb mentioned this pull request Mar 29, 2022
12 tasks
@codecov
Copy link

codecov bot commented Mar 29, 2022

Codecov Report

Merging #1907 (a90a1b4) into latest (a78361d) will increase coverage by 0.13%.
The diff coverage is 99.23%.

@@            Coverage Diff             @@
##           latest    #1907      +/-   ##
==========================================
+ Coverage   82.82%   82.96%   +0.13%     
==========================================
  Files         122      123       +1     
  Lines       13254    13365     +111     
  Branches     1789     1808      +19     
==========================================
+ Hits        10978    11088     +110     
  Misses       2013     2013              
- Partials      263      264       +1     
Flag Coverage Δ
python 90.83% <99.23%> (+0.10%) ⬆️
rust 65.80% <ø> (ø)

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files Coverage Δ
src/sourmash/sourmash_args.py 93.39% <50.00%> (-0.17%) ⬇️
src/sourmash/cli/sig/__init__.py 100.00% <100.00%> (ø)
src/sourmash/cli/sig/check.py 100.00% <100.00%> (ø)
src/sourmash/manifest.py 94.24% <100.00%> (+0.69%) ⬆️
src/sourmash/picklist.py 91.09% <100.00%> (+0.58%) ⬆️
src/sourmash/sig/__main__.py 93.56% <100.00%> (+0.58%) ⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update a78361d...a90a1b4. Read the comment docs.

@ctb ctb changed the title [WIP] add sourmash sig check for comparing picklists and databases [MRG] add sourmash sig check for comparing picklists and databases Mar 29, 2022
@ctb
Copy link
Contributor Author

ctb commented Mar 29, 2022

ready for review & merge @sourmash-bio/devs

Copy link
Contributor

@bluegenes bluegenes left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

🎉 ! Just the one concern about rebuilding the manifest, really

src/sourmash/sig/__main__.py Show resolved Hide resolved
src/sourmash/sig/__main__.py Show resolved Hide resolved
@ctb ctb merged commit 3b267fe into latest Mar 30, 2022
@ctb ctb deleted the add/sig_check branch March 30, 2022 00:51
@ctb
Copy link
Contributor Author

ctb commented Mar 30, 2022

🎉

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

rework database construction and release process to use manifests
2 participants