-
Notifications
You must be signed in to change notification settings - Fork 73
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
centrifuge-download error extra operand '.gz' #201
Comments
It feels like the file assembly_summary.txt or assembly_summary_filtered.txt is wrong(missing some columns, or some tabs become spaces). Does the same issue happen to your separate job? |
same issue if run separately but its only occurring with the bacteria and only with 5 genomes, working fine with virus, fungi and archaea |
any updates on this? colleagues are having tha same issue when trying to download bacterial genomes |
I could not reproduce this error on our server. What is the bash version on your system? |
@stephaniepillay @mourisl no we didn't solve the issue, the work around was to change the order of the download with bacteria being last on the list so the job would run but accept that those few sequences wouldn't not download. For me it was 5 sequences which didnt seem too much of an issue in the grander scheme of the bacterial sequences but others had around 50 that have failed. These individuals have repeated the download step for bacteria several times and this number reduced |
@mourisl bash, version 4.2.46 |
I'm getting this exact same issue with "Error downloading na/654 na_genomic.fna.gz!" This then crashes the rest of the build. Bash version: 4.2.46(2)-release Did anyone ever find a solution? If not, is there a recommended workaround? |
Have the same error looking for solution |
Hi everyone, In case this is still an issue for some of you, the problem seems to be similar to #221 which has been solved by @mourisl in commit a5c09bb29a3a828d88be49c55353cd84b6b9bbad but only for the viral database. So I solved this issue by downloading the updated centrifuge-download and changing @mourisl It seems that the patch actually works for all domains since it handles both cases (field 20 or 21) so the "if" condition seems unnecessary to me. By the way, the line That's all, I hope this will be helpful. |
@gbikpi Thanks for testing! I will update the script and merge it to the master. |
The patch is merged to the master branch. Now all the domains will use the (maybe) more robust parsing strategy. |
Thanks for updating,
|
Hello, I can confirm there's still the same error for viral genomes. |
we recently went though downloading/building an index again for a new student a few of the bacterial genomes failed (20 didn't download). This time we had the issue everyone else was having with the viral genome with it completely failing |
I am having exactly the same issue as @oatesa describes. Is there a workaround possible? |
Note: no error message is displayed for not downloading the last 20 bacterial genomes |
Hello, I also encountered the same error while downloading the virus genome especially. As mentioned above, I replaced the |
Hello, I tried the fix listed by others using the updated curl v 7.82.0 Hope this is solveable |
I'm having the same problem when running:
. I'm using centrifuge/1.0.4. Looking at the script
I think the problem is in this
Is looking for a string with
So I changed the command to search for |
We are recently decided to update our index so started from scratch (deleting old/dated index etc).
We ran centrifuge-download -o library -m -d "archaea,bacteria,viral,fungi" refseq >> seqid2taxid.map. Archaea was successful, but we received errors with bacteria
4247/19206basename: extra operand '.gz'
Try 'basename --help' for more information.
Error downloading na/562 na_genomic.fna.gz!
basename: extra operand '.gz'
Try 'basename --help' for more information.
overall this related to 5 genomes (stopped 5 short of the total) and did not progress to viral or fungi index download. I have ran these are a separate job (currently running) but wondered what this error could relate to and how to correct it.
Thanks in advance
The text was updated successfully, but these errors were encountered: