Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Handling viral taxon #5

Open
swelbo opened this issue Sep 17, 2024 · 1 comment
Open

Handling viral taxon #5

swelbo opened this issue Sep 17, 2024 · 1 comment

Comments

@swelbo
Copy link

swelbo commented Sep 17, 2024

Hello again!

Hope you are well.

We are getting this error when running virus IDs - tested on 11709 and 2697049.

Error: invalid or unsupported assembly accession:

The accession look a bit strange (see below):

n U01866.1
n Unknown:NC_001450.1/
n NC_001452.1/L06906.1
n NC_001511.1/M34193.1

Our bacterial targets all work well but seems to fail when we try to run viruses. Is this a work in progress or perhaps not a feature?

Cheers!

@haubold
Copy link
Contributor

haubold commented Sep 18, 2024

The intention is, that neighbors can handle viruses like bacteria (and eukaryotes for that matter). So thank you for pointing out these mal-formed viral accessions it returns. I just edited neighbors to straighten them out. All accessions returned by neighbors for your two example taxa can now be parsed by datasets. The program datasets, however, treats viruses differently from bacteria and eukaryotes. So to download the genomes with accessions listed in acc.txt, you'd execute something like

datasets download virus genome accession --inputfile acc.txt --include genome

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants