We are in the process of merging this into the main sourmash codebase, so this code now lives under https://github.com/dib-lab/sourmash
Initial pull of this code into sourmash: sourmash-bio/sourmash#424
sourmash core functionality implemented in Rust
sourmash is a command-line tool and Python library for computing MinHash sketches from DNA sequences, comparing them to each other, and plotting the results. This allows you to estimate sequence similarity between even very large data sets quickly and accurately. The core data structure is implemented in C++.
There is a PR in sourmash to replace the C++ core with this implementation (tests passing, yay!).
Another goal is to compile this code to webassembly and use it in the browser. There is a NPM package already available, based on this PR. For an example usage of the NPM package check wort-dnd.
For more details, check luizirber/2018-python-rust for a poster presented at GCCBOSC and SciPy 2018.
This project is licensed under a BSD 3-Clause License.