-
Notifications
You must be signed in to change notification settings - Fork 80
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
need exactly one. Specify --ksize or --dna/--protein. #317
Comments
Hi Mick, this looks like an old version of sourmash, so I'm not sure what
bugs might be lurking. It looks correct to me, though.
In the new version:
sourmash search AF-H4_S10_L001.sig genbank-k31.sbt.json
should work.
It's possible that you are using an old version of sourmash with a new
version of the SBT, for example.
If you send me the .sig I can take a look at it to see if anything looks
weird.
best,
--titus
|
(if you just want to get it to work without doing anything else, the
only suggestion I have is to try adding '--dna' to the command)
|
Hey Titus Yeah we tried --dna and it didn't work For the record, which installation instructions should we follow to get the latest stable version? Cheers |
This tutorial:
https://sourmash.readthedocs.io/en/latest/tutorials.html
will install the latest master branch, which should be as stable as it gets
until we do a 2.0 release.
best,
--titus
|
Closing: I think the issue here was a version issue and should be resolved by installing the latest version. Will be addressed by doing "unstable" releases (see #324), also. |
I seem to be getting this same error with with Command:
Output:
Adding Any ideas on what's going wrong? |
hi @nick-youngblut how was the bins_DASTool.sig computed? there's no simple way for sourmash to figure out which of the 6 signatures that are in there is the one to run gather on; for the moment the fix would be to put the signatures in individual files. Kind of a gap in the sourmash tool chain - eventually we'd like to be able to provide a signature name or md5 sum on the command line, but it's not there yet. |
Thanks for the quick response! I computed bins_DASTool.sig with the following:
The input was 6 fasta-formated metagenome bins. |
On Sat, Feb 10, 2018 at 02:52:03PM +0000, Nick Youngblut wrote:
Thanks for the quick response! I computed bins_DASTool.sig with the following:
```
sourmash compute --scaled 10000 -k 31 -o tests/output_amy/sourmash/bins_DASTool.sig tests/output_amy/bin_DASTool_bins/*.fa
```
The input was 6 fasta-formated metagenome bins.
OK! For now, I'd suggest just omitting the -o and it will produce 6 .fa.sig
files in the current directory; then run gather on each of those.
If you want to combine all six into one, you can either do
cat *.fa | sourmash compute - -o output.sig -k 31 --scaled 10000
or (I think)
sourmash compute *.fa --merged=someNameHere -o output.sig -k 31 --scaled 10000
…-- I can check on that if you're interested but I'm on a loaner laptop right
now :)
|
Great! Yeah, I'm trying to avoid multiple input, multiple output commands, which complicates I'll give the bin-file merging options a try. Will I still be able to differentiate the bins after merging, or will I just get a full list of taxonomic classifications for the entire merged set of bins? |
On Sat, Feb 10, 2018 at 02:59:22PM +0000, Nick Youngblut wrote:
Great! Yeah, I'm trying to avoid multiple input, multiple output commands, which complicates `snakemake` pipelines.
Ahh, I see. Apologies ;)
I'll give the bin-file merging options a try. Will I still be able to differentiate the bins after merging, or will I just get a full list of taxonomic classifications for the entire merged set of bins?
The latter, I'm afraid!
|
hi - i m having the exact same problem when trying to classify multiple contigs all in a single file Output: select query k=31 automatically. Is the solution to split in individual files and use gather? like mentioned above? |
yep! you can now use |
Hi
I am not sure exactly what this means.
I created a sig from reads using --scaled 10000 and -k 31
I then downloaded the k31 genbank database.
Now when I run:
I get:
What's happening? Same thing with sbt_gather
Perhaps as importantly, what command should I be running to compare reads from a metagenome against GenBank?
Cheers
Mick
The text was updated successfully, but these errors were encountered: