Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How does "gather" work? #296

Closed
phiweger opened this issue Jul 25, 2017 · 3 comments
Closed

How does "gather" work? #296

phiweger opened this issue Jul 25, 2017 · 3 comments

Comments

@phiweger
Copy link

phiweger commented Jul 25, 2017

Looking into the code for sourmash gather I built the following intuition about how it works algorithmically, and would like to know if I am correct:

Given a query and a database, it finds the best match. Then, the corresponding hashes/ k-mers are removed from the query and we repeat this until either no query is left or no matches are found.

Does this mean that if two organisms with more or less identical copies of x % of their genome (such as closely related strains) are present in a metagenomic sample, gather will report only one of the strains and not the other?

Thanks.

@ctb
Copy link
Contributor

ctb commented Jul 25, 2017 via email

@ctb
Copy link
Contributor

ctb commented Feb 18, 2018

This is partly addressed in #393, and will be addressed further in documentation for 2.0.

@ctb
Copy link
Contributor

ctb commented May 3, 2020

@ctb ctb closed this as completed May 3, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants