You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
ctb
changed the title
creating a signature by downloading and sketching a genome sequence
create a signature by downloading and sketching a genome sequence
May 12, 2022
first, download a genome:
This will create a 1.4MB file
GCF_000005845.2_ASM584v2_genomic.fna.gz
containing an E. coli K-12 genome for strain MG1655 (see Genbank entry).Next, calculate the signature using
sourmash sketch dna
:here, the
-p abund
tellssourmash sketch
to also retain the abundance (frequency) information for k-mers.This will produce a signature file,
GCF_000005845.2_ASM584v2_genomic.fna.gz.sig
, that is much smaller than the original genome file (86k vs 1.4 MB).You can view the metadata properties of this signature with
sourmash sig describe
:This example was taken from Large scale sequence comparisons with sourmash, Pierce et al., 2019.
The text was updated successfully, but these errors were encountered: