-
Notifications
You must be signed in to change notification settings - Fork 80
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
can we make some of the 'sourmash signature' functions work on streams? #609
Comments
thinking more about this, what I'm really after is chaining (ref). In particular, I'd really like to be able to do something like this:
which would let us watch loss of k-mers etc from these commands. (I'm already using this kind of thing in spacegraphcats to monitor information loss at various steps.) More specifically to this issue, I have visions of
or even more ambitiously
cc @standage |
To be clear, you're suggesting a tube that takes in a stream of sequences, maintains a minhash sketch of incoming k-mers, and spits out each sequence unmodified? Sounds super useful, and (although I know little about how sourmash is implemented) should be straightforward I would think. If you wanted to fully support a continuous data stream, you'd probably want to write the minhash sketch to disk at regular intervals. But I doubt that use case is in high demand just yet. :-) |
see also: sourmash watch, which does essentially this :)
|
ref #587. e.g.
sourmash sig describe
could certainly be made streaming; would be interesting to explore for other commands, as an alternative solution to adding detailed/specific command line parameters as in e.g. #560.The text was updated successfully, but these errors were encountered: