Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

sourmash tax prepare sometimes fails to create db? #2211

Open
bluegenes opened this issue Aug 15, 2022 · 1 comment
Open

sourmash tax prepare sometimes fails to create db? #2211

bluegenes opened this issue Aug 15, 2022 · 1 comment

Comments

@bluegenes
Copy link
Contributor

bluegenes commented Aug 15, 2022

I'm seeing some odd behavior, but not sure if this is server-related or sourmash related.

On a compute node, tax prepare will refuse to prepare a new lineages database, saying the table already exists. After running, a new file of that name exists but is empty.

sourmash tax prepare -t gtdb-rs207.taxonomy.csv -o gtdb-rs207.taxonomy.db

== This is sourmash version 4.4.4.dev2+g2f38f6c2. ==
== Please cite Brown and Irber (2016), doi:10.21105/joss.00027. ==

loading taxonomies...
...loaded 317542 entries.
saving to 'gtdb-rs207.taxonomy.db', format sql...
ERROR while saving!
taxonomy table already exists in 'gtdb-rs207.taxonomy.db'

file: 512 -rw-r--r-- 1 ntpierce ctbrowngrp 0 Aug 15 15:14 gtdb-rs207.taxonomy.db

On the login node, running the exact same command seems to work, even without removing the empty file.

sourmash tax prepare -t gtdb-rs207.taxonomy.csv -o gtdb-rs207.taxonomy.db

== This is sourmash version 4.4.4.dev2+g2f38f6c2. ==
== Please cite Brown and Irber (2016), doi:10.21105/joss.00027. ==

loading taxonomies...
...loaded 317542 entries.
saving to 'gtdb-rs207.taxonomy.db', format sql...
done!

file: 15M -rw-r--r-- 1 ntpierce ctbrowngrp 54M Aug 15 15:17 gtdb-rs207.taxonomy.db

Running the command again once the file is non-empty produces the error seen above.

@ctb
Copy link
Contributor

ctb commented Aug 27, 2022

I can't replicate this on farm nodes c6-58 or bm10 - but I believe it happens ;). I've seen weird things happen with sqlite on the nodes on our HPC. I think it has to do with locking, but that's just a guess.

Next time you run across this problem, could you record the specific node and filesystem you're on? thanks!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants