- Grab the code
$ git clone git@github.com:MGXlab/gradient_virome.git
$ conda create -n gradient_virome --file conda-linux-64.lock
$ conda activate gradient_virome
- Define a tab-separated samplesheet with a header and column names
sample_id
pair_id
: mucosal and luminal paired idgradient
:mucosal
orluminal
(can be anything really)R1
: Path to forward fastq fileR2
: Path to reverse fastq fileR1_MD5
: MD5 numbers of Generated R1 FASTQ filesR2_MD5
: MD5 numbers of Generated R2 FASTQ files- compartment: almond: 0-rhizosphere, 1-bulk soil; sponge: 0-sponge, 1-seawater; gut: 0-lumen, 1-mucosa.
-
Fill in the
config.yaml
based on your needs. -
Execute from within this dir.
-
Datasets:
- almond: https://www.researchgate.net/publication/371360764_Almond_rhizosphere_viral_prokaryotic_and_fungal_communities_differed_significantly_among_four_California_orchards_and_in_comparison_to_bulk_soil_communities
- sponge: https://www-sciencedirect-com.proxy.library.uu.nl/science/article/pii/S1931312819304287#sec4
- mammalian gut: https://www.nature.com/articles/s41564-022-01178-w
# This is a dry run
(gradient_virome)$ snakemake -j 94 --use-conda --conda-frontend mamba -p -n
- Remove
-n
to run it
Short, paired-end sequencing reads of a sample.