-
Notifications
You must be signed in to change notification settings - Fork 202
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Expected files not created when seqTaxDB is made from existing BLAST database #401
Comments
Sorry I didn't update the documentation. The |
I see that makes sense. However, the resultDB from running a search against my nrDB is still an alignment database, and not a taxonomy database, so I cannot use the Edit: actually it looks like the
But no |
I think you are nearly there, you just have to swap the order or |
Thank you for the suggestion. I tried it and here is the log file:
Edit: also the usage for
So according to this, I believe that nrDB should go second? |
What command did you use to create the
|
I used the following command, where
Only three files were created as the result:
If I do
If I do
I also tried searching against my
I see that makes sense about Edit: also I don't know if I mentioned that I am using |
Usually you would combine the
You could also assign one taxonomic label per search result by using either the
Doing this would however mean that you are not using our 2bLCA procedure that gives a stable label. Especially in combination with a profile search I would expect this to result in many labels very close to the root node. EDIT: I don't you are missing anything important that happened since that commit you used. |
I tried using the
However, I get the following warning:
Is it not possible to generate a taxonomy report from
but my resultDB from searching against a |
You have to supply a sequence database (e.g. That sentence is also not very precise. You cannot use |
I see. So when the documentation says
"searching" does not refer to Here is the work-around I used previously to see the taxonomy of my search results. Does this look pretty normal to you, or is there a better way to do this? There seems to be no good way to go from an alignment database (search result) to a sequence database which can then can be used in the
|
Let me summarize: For a search result you want either a Kraken or Krona style report of all NR targets found for all queries? |
Yes that is correct. I originally thought that I could use |
That use-case makes sense, I think I'll implement it soonish (so that you can do that directly with
|
Okay sorry, this will not work after all, as |
Wow thank you so much!! I will keep an eye out for it. And thanks for answering my questions. |
I can make a pull request with changes to the documentation based on our discussion if you'd like. Otherwise feel free to close this issue. |
I think this should do what you want. Especially the Krona plots might get a bit overwhelming though (due to the potentially large number of alignments contributing to it). Anyone should be able to contribute to the wiki, I think you can just edit it. |
Amazing! I would install by compiling source, but my cluster only has g++ v9.8.5. Are the static linux versions updated? I just tried reinstalling with
but the changes aren't reflected there yet. |
The static binaries take about one hour to build (running the tests take some time). |
I got the latest version. The new |
First of all, I must thank all the MMseqs contributers for all the excellent documentation and support! The wiki has been extremely helpful.
Problem: Expected seqTaxDB
.dmp
files not createdI followed the instructions in the wiki for creating a seqTaxDB from an existing BLAST database. I am using the NR database.
The documentation says that the following files should be created:
However, am missing several of these files. Here are the files created. Notably, all the
.dmp
files are not being created.I know that this is at least a valid aminoacid database since I an search against it and get hits. However, I cannot use the
taxonomyreport
command on the results since it tells me that the result is an alignment database and not a taxonomy database. Similarily, when I run thetaxonomyreport
command with the nrDB as the result and seqTaxDB, it tells me that nrDB is an aminoacid database.I tried copying the
.dmp
files from the downloaded taxonomy into the same folder as my database, and renaming them tonrDB_merged.dmp
,nrDB_names.dmp
, andnrDB_nodes.dmp
. My database is still not being recognized as a taxonomy database though.createdb log file
createtaxdb log file
Thanks for taking the time to look at this!
The text was updated successfully, but these errors were encountered: