-
Notifications
You must be signed in to change notification settings - Fork 309
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
make reading fasta less memory hungry #1458
Comments
Is this on a single node? What are you using for |
Yes, it is signle node with default fragment_length |
Any progress on this? |
Sorry, no major news here. We've been backlogged with the 0.23.0 release and Python/R API additions. How are you using the output of the conversion? If you are using it for a broadcast object via the ReferenceFile API, you may want to consider the TwoBitFile implementation, which is much more efficient from a memory capacity perspective. |
I'll take a look at this soon though; we added sequence dictionary loading code that may eliminate the need for some of the code and caching that is in the FASTA load path. |
Fixed by #2175 |
When I download latest human genome release and convert fasta to adam it eats >22 Gb memory (for 3.2 GB) file. It would be nice to optimize it somehow in terms of memory consumption.
The text was updated successfully, but these errors were encountered: