Skip to content

Latest commit

 

History

History
36 lines (28 loc) · 1.55 KB

README.md

File metadata and controls

36 lines (28 loc) · 1.55 KB
  • Install environment: conda env create --file environment.yaml
  • Activate environment: conda activate snakemake
  • Run analysis:
cd workflows/{analysis of choice}
snakemake --forceall --cores

Results

csv-tables containing the similarities between all samples are in the workflows/{comparison}/aggregated_sim folder.

Conducted comparisons are:

real world results

Input data is from here

using megago

molecular function

using string based comparison

according to methods in preprint of metaproteomics tool survey

molecular function

random set comparison

GO-terms are sampled from SwissProt to resemble real-world data. Two sets of equal size are drawn n times, repeated for different sample sizes. random sets