BulkRNAseq provides a simplified pipeline to analyze bulk RNAseq data.
The script consists mainly of :
- "kallisto quant" for running Kallisto quantification of transcript abundance
- "sleuthAnalysis.R" for running Sleuth differential analysis
- "combat.R" for running ComBat batch correction
- "ggbiplot.R" for running principal component analysis
Inputs (placed in the same directory as the source codes) : skip to step #3 if the expression marix is already available
- Raw sequence fastq files ".fastq.gz"
- Specify fragment length and its uncertainty, number of bootsraps and threads
- Samples covariates and batches information in a file named "model_matrix.txt"
- Expression matrix in ".txt" with column as samples x row as transcripts or genes (filename must contain the word "expression")
Get the pipeline run directly on terminal by : " source BulkRNAseq.sh "
References and installations (including system requirements and dependencies) :
Kallisto -> https://pachterlab.github.io/kallisto/
Sleuth -> https://pachterlab.github.io/sleuth/
ComBat -> https://www.bu.edu/jlab/wp-assets/ComBat/Abstract.html
ggbiplot -> https://github.com/vqv/ggbiplot