Skip to content

Snakemake pipeline for running automated RNAseq analysis, from fastq reads to Differential Expression (DE) analysis. The latest version includes rules for post-DE annotation of genes, including annotation and/or orthology results.

Notifications You must be signed in to change notification settings

JasonCharamis/ARuS

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

ARuS (Automated RNAseq analysis using Snakemake)

This is a fully automated Snakemake pipeline for streamlining end-to-end RNAseq analysis, from fastq reads to DE analysis.

To use as a Docker container, run:

  1. git clone https://github.com/JasonCharamis/ARuS.git
  2. cd ARuS/workflow/ && sudo docker build -t automated_rnaseq_analysis:latest .
  3. sudo docker run -it -v $(pwd):/mnt/workdir -w /mnt/workdir automated_rnaseq_analysis:latest snakemake --snakefile ARuS/workflow/Snakefile --cores 1 --use-conda --conda-frontend mamba

Usage: Wildcard for sample identification is "{sample}_1.fastq.gz" and "{sample}_2.fastq.gz".

The pipeline is designed for 150-bp paired-end Illumina reads and it includes:

  1. Read quality control (QC) and adapter-trimming
  2. Mapping of reads against provided genome sequence
  3. Assign mapped reads to genes - this step also computes TPM values and uses them to produce a PCA plot
  4. Differential expression (DE) analysis using edgeR
  5. Post-DE annotation of DE genes and optionally combine with orthology results

Dependencies:

  1. FASTQC https://github.com/s-andrews/FastQC

  2. Trimommatic https://github.com/usadellab/Trimmomatic

  3. STAR https://github.com/alexdobin/STAR

  4. featureCounts https://github.com/torkian/subread-1.6.1

  5. edgeR https://bioconductor.org/packages/release/bioc/html/edgeR.html

  6. Trinity-bundled Perl scripts for DE analysis using edgeR https://github.com/trinityrnaseq/trinityrnaseq

If you use it on hisat2-mapping mode, you will also need:

  1. hisat2 https://github.com/DaehwanKimLab/hisat2

  2. samtools https://github.com/samtools/samtools

Every dependency is automatically installed through conda.

About

Snakemake pipeline for running automated RNAseq analysis, from fastq reads to Differential Expression (DE) analysis. The latest version includes rules for post-DE annotation of genes, including annotation and/or orthology results.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published