-
Notifications
You must be signed in to change notification settings - Fork 32
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Taxon taxid reassigned with reformat #42
Comments
Yes, they are. They should be merged.
However, 2507530 and 2516889 have the exactly same lineage :( One solution is giving an option to specify the TaxId field for cases where TaxIds are available. |
There are 52 more cases.
more cases
more details
|
Done. Now, for these cases, warning messages are shown, and no data returns.
If TaxIds are available, use
|
Tremendous. Thank you! |
By the way, I submitted |
Hi @standage , any responce from NCBI? Do you have any other issues while using or suggestions? I'd like to release a new version with this improved |
I haven't had any other issues, thanks! NCBI responded with the following.
I didn't point them to this thread, I only mentioned |
I check the latest taxdump files, some were merged while some not.
|
@shenwei356 First of all, thank you very much for creating this great tool! It has been very helpful in my research. If I understood correctly, the warning should only appear, if two lineages are completely identical. However, I also get this warning for two species with the same name and a different lineage. I am using taxonkit 0.80 and the taxdump downloaded today.
produces
But the lineages of the two taxa are not identical:
Is this expected behavior? |
By default, taxonkit reformat find the taxid from the taxon name and name of its parent taxon. Here, it's "Asterina;Asterina gibbosa". If TaxIds are available, use
|
Thank you for your swift reply! That makes sense. Actually, I wasn't aware of that option, but it makes life easier for me. |
Hello, I noticed some unexpected behavior today. When I query and reformat the lineage for taxid 2507530,
taxonkit reformat
re-assigns 2516889 as the taxid in the output (the last taxid in the line).It looks like these may be duplicated, unmerged taxids.
Obviously, we should hope NCBI fixes this in the taxdump soon. But I'm assuming this is not the intended
taxonkit
behavior?Prerequisites
taxonkit version
Describe your issue
The text was updated successfully, but these errors were encountered: