Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Kallisto quantification #1106

Merged
merged 66 commits into from
Nov 15, 2023
Merged
Show file tree
Hide file tree
Changes from 1 commit
Commits
Show all changes
66 commits
Select commit Hold shift + click to select a range
0979aee
naive kallisto addition to prepare_genome, following salmon pattern (…
pinin4fjords Oct 31, 2023
882843b
Add kallisto modules
pinin4fjords Oct 31, 2023
952ab48
update schema
pinin4fjords Oct 31, 2023
ac4fa24
Pass kallisto index in workflow
pinin4fjords Oct 31, 2023
549cfef
Add kallisto quant subworkflow
pinin4fjords Oct 31, 2023
b32eba6
[skip ci] next batch of kallisto adaptations
pinin4fjords Oct 31, 2023
06f72dc
tximport is more generic than just salmon
pinin4fjords Nov 1, 2023
9e3d589
Fix kallisto indexing
pinin4fjords Nov 1, 2023
1a41fbf
[skip ci] Next phase of Kallisto integration
pinin4fjords Nov 1, 2023
c081697
[skip ci] Bump kallisto modules
pinin4fjords Nov 2, 2023
7eb755b
[skip ci] rewire for updated kallisto modules
pinin4fjords Nov 2, 2023
42cef1d
[skip ci] Make tx2gene.py generic
pinin4fjords Nov 2, 2023
4089972
[skip ci] Try chatgpt's tidied version of tx2gene.py
pinin4fjords Nov 2, 2023
d18c3f1
[skip ci] Tidied up and genericised tximport script
pinin4fjords Nov 2, 2023
2d3e08e
[skip ci] misc compatibility updates
pinin4fjords Nov 2, 2023
5be6a04
[skip ci] fix bin/tx2gene.py
pinin4fjords Nov 2, 2023
4d8911f
[skip ci] prioritise transcript_id
pinin4fjords Nov 2, 2023
a5b27f7
[skip ci] final kallisto results handling fixes
pinin4fjords Nov 2, 2023
73e1a54
more genericisation between kallisto and salmon
pinin4fjords Nov 3, 2023
2a24792
Remove debugging line
pinin4fjords Nov 3, 2023
aebad06
Fix multiqc to avoid imp module error
pinin4fjords Nov 3, 2023
2ada1e4
Fix process selector
pinin4fjords Nov 3, 2023
39589a6
add kallisto_index default
pinin4fjords Nov 3, 2023
839ac5c
Combine kallisto and salmon components
pinin4fjords Nov 3, 2023
aa6e8c0
appease eclint
pinin4fjords Nov 3, 2023
72766a5
[automated] Fix linting with Prettier
nf-core-bot Nov 3, 2023
7328c65
Update CHANGELOG
pinin4fjords Nov 3, 2023
3c56b68
Merge branch 'kallisto_quant' of https://github.com/nf-core/rnaseq in…
pinin4fjords Nov 3, 2023
6a5e485
blackify
pinin4fjords Nov 3, 2023
f40065d
Don't hardcode tx2gene_file
pinin4fjords Nov 3, 2023
943e6ab
syntax fix
pinin4fjords Nov 3, 2023
83217d3
Add some kallisto tests
pinin4fjords Nov 3, 2023
6f85da7
Update metro maps
pinin4fjords Nov 3, 2023
7bda41e
[skip ci] Fix metro maps
pinin4fjords Nov 3, 2023
6479792
Update strandedness params for kallisto and salmon
pinin4fjords Nov 3, 2023
92716b4
[skip ci] appease linter
pinin4fjords Nov 3, 2023
cadec09
Fix kallisto publishing
pinin4fjords Nov 3, 2023
908c6f2
[skip ci] fix usage.md for Kallisto
pinin4fjords Nov 3, 2023
334088e
Merge branch 'kallisto_quant' of github.com:nf-core/rnaseq into kalli…
pinin4fjords Nov 3, 2023
0fc5407
[automated] Fix linting with Prettier
nf-core-bot Nov 3, 2023
e9ed01d
Update fraglen defaults
pinin4fjords Nov 3, 2023
1467c81
MultiQC needs STDOUT from Kallisto
pinin4fjords Nov 6, 2023
7d04886
Fix Kallisto MultiQC
pinin4fjords Nov 6, 2023
8420e99
Update docs
pinin4fjords Nov 6, 2023
46866ae
[automated] Fix linting with Prettier
nf-core-bot Nov 6, 2023
b39e7e8
Correct kallisto out path
pinin4fjords Nov 6, 2023
36694f8
Merge branch 'kallisto_quant' of github.com:nf-core/rnaseq into kalli…
pinin4fjords Nov 6, 2023
58bbc27
Add prefix usage to local modules to facilitate batched runs
pinin4fjords Nov 7, 2023
ab4c662
[skip ci] Address easy review feedback
pinin4fjords Nov 7, 2023
c612dd0
[skip ci] Update config per latest
pinin4fjords Nov 9, 2023
4a81e64
[skip ci] fix font sizes in metro map file icons
pinin4fjords Nov 9, 2023
d5c6f7f
thank myself to poke ci
pinin4fjords Nov 9, 2023
d75d507
Reflect updated kallisto module
pinin4fjords Nov 9, 2023
b0cf295
[skip ci] test file not supposed to be there
pinin4fjords Nov 10, 2023
9dcaf0b
Merge branch 'dev' into kallisto_quant
pinin4fjords Nov 10, 2023
4f556ee
[skip ci] Add Kallisto outputs to Tower yml
pinin4fjords Nov 13, 2023
9ab9f7c
Post-review changes
drpatelh Nov 14, 2023
0c9c0aa
Fix Kallisto sample name rendering in MultiQC
drpatelh Nov 14, 2023
2decfdc
[skip ci] Alphabetically order enum fields for salmon strandedness
drpatelh Nov 14, 2023
5ea79ac
[skip ci] Bump module from branch
pinin4fjords Nov 14, 2023
9605b0f
Merge branch 'kallisto_quant' of github.com:nf-core/rnaseq into kalli…
pinin4fjords Nov 14, 2023
83d9fb4
[automated] Fix linting with Prettier
nf-core-bot Nov 14, 2023
6c75322
[skip ci] removed kallisto strandedness as a standalone parameter
pinin4fjords Nov 14, 2023
5458b5c
[skip ci] Fix kallisto log file ext
pinin4fjords Nov 14, 2023
68b8977
Fix tiny issue
pinin4fjords Nov 14, 2023
5ad263a
[automated] Fix linting with Prettier
nf-core-bot Nov 14, 2023
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
8 changes: 4 additions & 4 deletions CHANGELOG.md
Original file line number Diff line number Diff line change
Expand Up @@ -10,10 +10,10 @@ and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0
Special thanks to the following for their contributions to the release:

- [Adam Talbot](https://github.com/adamrtalbot)
- [Jonathan Manning](https://github.com/pinin4fjords)
- [Júlia Mir Pedrol](https://github.com/mirpedrol)
- [Matthias Zepper](https://github.com/MatthiasZepper)
- [Maxime Garcia](https://github.com/maxulysse)
- [Jonathan Manning](https://github.com/pinin4fjords)

Thank you to everyone else that has contributed by reporting bugs, enhancements or in any other way, shape or form.

Expand All @@ -38,7 +38,7 @@ Thank you to everyone else that has contributed by reporting bugs, enhancements
| Dependency | Old version | New version |
| ----------------------- | ----------- | ----------- |
| `fastqc` | 0.11.9 | 0.12.1 |
| `multiqc` | 1.14 | 1.15 |
| `multiqc` | 1.14 | 1.17 |
| `ucsc-bedgraphtobigwig` | 377 | 445 |

> **NB:** Dependency has been **updated** if both old and new version information is present.
Expand All @@ -65,7 +65,7 @@ Thank you to everyone else that has contributed by reporting bugs, enhancements
### Enhancements & fixes

- [[#1011](https://github.com/nf-core/rnaseq/issues/1011)] - FastQ files from UMI-tools not being passed to fastp
- [[#1018](https://github.com/nf-core/rnaseq/issues/1018)] - Ability to skip both alignment and pseudo-alignment to only run pre-processing QC steps.
- [[#1018](https://github.com/nf-core/rnaseq/issues/1018)] - Ability to skip both alignment and pseudoalignment to only run pre-processing QC steps.
- [PR #1016](https://github.com/nf-core/rnaseq/pull/1016) - Updated pipeline template to [nf-core/tools 2.8](https://github.com/nf-core/tools/releases/tag/2.8)
- [PR #1025](https://github.com/nf-core/fetchngs/pull/1025) - Add `public_aws_ecr.config` to source mulled containers when using `public.ecr.aws` Docker Biocontainer registry
- [PR #1038](https://github.com/nf-core/rnaseq/pull/1038) - Updated error log for count values when supplying `--additional_fasta`
Expand Down Expand Up @@ -813,7 +813,7 @@ Major novel changes include:
- Added options to skip several steps
- Skip trimming using `--skipTrimming`
- Skip BiotypeQC using `--skipBiotypeQC`
- Skip Alignment using `--skipAlignment` to only use pseudo-alignment using Salmon
- Skip Alignment using `--skipAlignment` to only use pseudoalignment using Salmon

### Documentation updates

Expand Down
2 changes: 1 addition & 1 deletion README.md
Original file line number Diff line number Diff line change
Expand Up @@ -39,7 +39,7 @@
3. [`dupRadar`](https://bioconductor.org/packages/release/bioc/html/dupRadar.html)
4. [`Preseq`](http://smithlabresearch.org/software/preseq/)
5. [`DESeq2`](https://bioconductor.org/packages/release/bioc/html/DESeq2.html)
15. Pseudo-alignment and quantification ([`Salmon`](https://combine-lab.github.io/salmon/) or ['Kallisto'](https://pachterlab.github.io/kallisto/); _optional_)
15. Pseudoalignment and quantification ([`Salmon`](https://combine-lab.github.io/salmon/) or ['Kallisto'](https://pachterlab.github.io/kallisto/); _optional_)
16. Present QC for raw read, alignment, gene biotype, sample similarity, and strand-specificity checks ([`MultiQC`](http://multiqc.info/), [`R`](https://www.r-project.org/))

> **Note**
Expand Down
2 changes: 1 addition & 1 deletion bin/summarizedexperiment.r
Original file line number Diff line number Diff line change
Expand Up @@ -2,7 +2,7 @@

library(SummarizedExperiment)

## Create SummarizedExperiment (se) object from Salmon counts
## Create SummarizedExperiment (se) object from counts

args <- commandArgs(trailingOnly = TRUE)
if (length(args) < 3) {
Expand Down
14 changes: 4 additions & 10 deletions conf/modules.config
Original file line number Diff line number Diff line change
Expand Up @@ -705,11 +705,9 @@ if (!params.skip_alignment && params.aligner == 'star_salmon') {
if (!params.skip_qc & !params.skip_deseq2_qc) {
process {
withName: 'DESEQ2_QC_STAR_SALMON' {
ext.prefix = "deseq2"
ext.args = { [
"--id_col 1",
"--sample_suffix ''",
"--outprefix deseq2",
"--count_col 3",
params.deseq2_vst ? '--vst TRUE' : ''
].join(' ').trim() }
Expand Down Expand Up @@ -770,11 +768,9 @@ if (!params.skip_alignment && params.aligner == 'star_rsem') {
if (!params.skip_qc & !params.skip_deseq2_qc) {
process {
withName: 'DESEQ2_QC_RSEM' {
ext.prefix = "deseq2"
ext.args = { [
"--id_col 1",
"--sample_suffix ''",
"--outprefix deseq2",
"--count_col 3",
params.deseq2_vst ? '--vst TRUE' : ''
].join(' ').trim() }
Expand Down Expand Up @@ -1085,10 +1081,10 @@ if (!params.skip_multiqc) {
}

//
// Salmon/ Kallisto pseudo-alignment options
// Salmon/ Kallisto pseudoalignment options
//

if (params.pseudo_aligner == 'salmon') {
if (!params.skip_pseudo_alignment && params.pseudo_aligner == 'salmon') {
process {

withName: '.*:QUANTIFY_PSEUDO_ALIGNMENT:SALMON_QUANT' {
Expand All @@ -1102,15 +1098,14 @@ if (params.pseudo_aligner == 'salmon') {
}
}

if (params.pseudo_aligner == 'kallisto') {
if (!params.skip_pseudo_alignment && params.pseudo_aligner == 'kallisto') {
process {
withName: '.*:QUANTIFY_PSEUDO_ALIGNMENT:KALLISTO_QUANT' {
ext.args = params.extra_kallisto_quant_args ?: ''

publishDir = [
path: { "${params.outdir}/${params.pseudo_aligner}" },
mode: params.publish_dir_mode,
saveAs: { filename -> filename.equals('versions.yml') || filename.endsWith('_run_info.json') ? null : filename }
saveAs: { filename -> filename.equals('versions.yml') || filename.endsWith('.run_info.json') || filename.endsWith('.log.txt') ? null : filename }
]
}
}
Expand Down Expand Up @@ -1150,7 +1145,6 @@ if (!params.skip_pseudo_alignment) {
ext.args = { [
"--id_col 1",
"--sample_suffix ''",
"--outprefix deseq2",
"--count_col 3",
params.deseq2_vst ? '--vst TRUE' : ''
].join(' ').trim() }
Expand Down
16 changes: 8 additions & 8 deletions docs/output.md
Original file line number Diff line number Diff line change
Expand Up @@ -41,9 +41,9 @@ The pipeline is built using [Nextflow](https://www.nextflow.io/) and processes d
- [featureCounts](#featurecounts) - Read counting relative to gene biotype
- [DESeq2](#deseq2) - PCA plot and sample pairwise distance heatmap and dendrogram
- [MultiQC](#multiqc) - Present QC for raw reads, alignment, read counting and sample similiarity
- [Pseudo-alignment and quantification](#pseudo-alignment-and-quantification)
- [Salmon](#salmon) - Wicked fast gene and isoform quantification relative to the transcriptome
- [Kallisto](#kallisto) - Near-optimal probabilistic RNA-seq quantification
- [Pseudoalignment and quantification](#pseudoalignment-and-quantification)
- [Salmon](#pseudoalignment) - Wicked fast gene and isoform quantification relative to the transcriptome
- [Kallisto](#pseudoalignment) - Near-optimal probabilistic RNA-seq quantification
Wicked fast gene and isoform quantification relative to the transcriptome
- [Workflow reporting and genomes](#workflow-reporting-and-genomes)
- [Reference genome files](#reference-genome-files) - Saving reference genome indices/files
Expand Down Expand Up @@ -205,7 +205,7 @@ The STAR section of the MultiQC report shows a bar plot with alignment rates: go

![MultiQC - STAR alignment scores plot](images/mqc_star.png)

[Salmon](https://salmon.readthedocs.io/en/latest/salmon.html) from [Ocean Genomics](https://oceangenomics.com/) and [Kallisto](https://pachterlab.github.io/kallisto/), from the Pachter Lab, are provided as options for pseudo-alignment. Both allow quantification of reads against an index generated from a reference set of target transcripts. By default, the transcriptome-level BAM files generated by STAR are provided to Salmon for downstream quantification, and Kallisto is not an option here (it does not allow BAM file input). But you can provide FASTQ files directly as input to either Salmon or Kallisto in order to pseudo-align and quantify your data by providing the `--pseudo_aligner (salmon or kallisto)` parameter. See the [Salmon](#salmon) and (Kallisto)[#kallisto] results sections for more details.
[Salmon](https://salmon.readthedocs.io/en/latest/salmon.html) from [Ocean Genomics](https://oceangenomics.com/) and [Kallisto](https://pachterlab.github.io/kallisto/), from the Pachter Lab, are provided as options for pseudoalignment. Both allow quantification of reads against an index generated from a reference set of target transcripts. By default, the transcriptome-level BAM files generated by STAR are provided to Salmon for downstream quantification, and Kallisto is not an option here (it does not allow BAM file input). But you can provide FASTQ files directly as input to either Salmon or Kallisto in order to pseudoalign and quantify your data by providing the `--pseudo_aligner salmon` or `--pseudo_aligner kallisto` parameter. See the [Salmon](#pseudoalignment) and [Kallisto](#pseudoalignment) results sections for more details.

### STAR via RSEM

Expand Down Expand Up @@ -670,9 +670,9 @@ The plot on the left hand side shows the standard PC plot - notice the variable

Results generated by MultiQC collate pipeline QC from supported tools i.e. FastQC, Cutadapt, SortMeRNA, STAR, RSEM, HISAT2, Salmon, SAMtools, Picard, RSeQC, Qualimap, Preseq and featureCounts. Additionally, various custom content has been added to the report to assess the output of dupRadar, DESeq2 and featureCounts biotypes, and to highlight samples failing a mimimum mapping threshold or those that failed to match the strand-specificity provided in the input samplesheet. The pipeline has special steps which also allow the software versions to be reported in the MultiQC output for future traceability. For more information about how to use MultiQC reports, see <http://multiqc.info>.

## Pseudo-alignment and quantification
## Pseudoalignment and quantification

### Pseudo-alignment
### Pseudoalignment

The principal output files are the same between Salmon and Kallsto:

Expand Down Expand Up @@ -717,10 +717,10 @@ An additional subset of files are distinct to each tool, for Salmon:
- `abundance.h5`: a HDF5 binary file containing run info, abundance esimates, bootstrap estimates, and transcript length information length. This file can be read in by [sleuth](https://pachterlab.github.io/sleuth/about).
- `abundance.tsv`: a plaintext file of the abundance estimates. It does not contains bootstrap estimates.
- `run_info.json`: a json file containing information about the run.
- `<SAMPLE>.log.txt`: standard output from the Kallisto process per sample.
- `kallisto_quant.log`: standard output from the Kallisto process per sample.
</details>

As described in the [STAR and Salmon](#star-and-salmon) section, you can choose to pseudo-align and quantify your data with [Salmon](https://salmon.readthedocs.io/en/latest/salmon.html) or [Kallisto](https://pachterlab.github.io/kallisto/) by providing the `--pseudo_aligner` parameter. By default, Salmon is run in addition to the standard alignment workflow defined by `--aligner`, mainly because it allows you to obtain QC metrics with respect to the genomic alignments. However, you can provide the `--skip_alignment` parameter if you would like to run Salmon or Kallisto in isolation. If Salmon or Kallisto are run in isolation, the outputs mentioned above will be found in a folder named `salmon` or `kallisto`. If Salmon is run alongside STAR, the folder will be named `star_salmon`.
As described in the [STAR and Salmon](#star-and-salmon) section, you can choose to pseudoalign and quantify your data with [Salmon](https://salmon.readthedocs.io/en/latest/salmon.html) or [Kallisto](https://pachterlab.github.io/kallisto/) by providing the `--pseudo_aligner` parameter. By default, Salmon is run in addition to the standard alignment workflow defined by `--aligner`, mainly because it allows you to obtain QC metrics with respect to the genomic alignments. However, you can provide the `--skip_alignment` parameter if you would like to run Salmon or Kallisto in isolation. If Salmon or Kallisto are run in isolation, the outputs mentioned above will be found in a folder named `salmon` or `kallisto`. If Salmon is run alongside STAR, the folder will be named `star_salmon`.

Transcripts with large inferential uncertainty won't be assigned the exact number of reads reproducibly, every time Salmon is run. Read more about this on the [nf-core/rnaseq](https://github.com/nf-core/rnaseq/issues/585) and [salmon](https://github.com/COMBINE-lab/salmon/issues/613) Github repos.

Expand Down
8 changes: 4 additions & 4 deletions docs/usage.md
Original file line number Diff line number Diff line change
Expand Up @@ -65,19 +65,19 @@ An [example samplesheet](../assets/samplesheet.csv) has been provided with the p

By default, the pipeline uses [STAR](https://github.com/alexdobin/STAR) (i.e. `--aligner star_salmon`) to map the raw FastQ reads to the reference genome, project the alignments onto the transcriptome and to perform the downstream BAM-level quantification with [Salmon](https://salmon.readthedocs.io/en/latest/salmon.html). STAR is fast but requires a lot of memory to run, typically around 38GB for the Human GRCh37 reference genome. Since the [RSEM](https://github.com/deweylab/RSEM) (i.e. `--aligner star_rsem`) workflow in the pipeline also uses STAR you should use the [HISAT2](https://ccb.jhu.edu/software/hisat2/index.shtml) aligner (i.e. `--aligner hisat2`) if you have memory limitations.

You also have the option to pseudo-align and quantify your data directly with [Salmon](https://salmon.readthedocs.io/en/latest/salmon.html) or [Kallisto](https://pachterlab.github.io/kallisto/) by specifying `salmon` or `kallisto` to the `--pseudo_aligner` parameter. The selected pseudoaligner will then be run in addition to the standard alignment workflow defined by `--aligner`, mainly because it allows you to obtain QC metrics with respect to the genomic alignments. However, you can provide the `--skip_alignment` parameter if you would like to run Salmon or Kallisto in isolation. By default, the pipeline will use the genome fasta and gtf file to generate the transcripts fasta file, and then to build the Salmon index. You can override these parameters using the `--transcript_fasta` and `--salmon_index` parameters, respectively.
You also have the option to pseudoalign and quantify your data directly with [Salmon](https://salmon.readthedocs.io/en/latest/salmon.html) or [Kallisto](https://pachterlab.github.io/kallisto/) by specifying `salmon` or `kallisto` to the `--pseudo_aligner` parameter. The selected pseudoaligner will then be run in addition to the standard alignment workflow defined by `--aligner`, mainly because it allows you to obtain QC metrics with respect to the genomic alignments. However, you can provide the `--skip_alignment` parameter if you would like to run Salmon or Kallisto in isolation. By default, the pipeline will use the genome fasta and gtf file to generate the transcripts fasta file, and then to build the Salmon index. You can override these parameters using the `--transcript_fasta` and `--salmon_index` parameters, respectively.

The library preparation protocol (library type) used by Salmon quantification is inferred by the pipeline based on the information provided in the samplesheet, however, you can override it using the `--salmon_quant_libtype` parameter. You can find the available options in the [Salmon documentation](https://salmon.readthedocs.io/en/latest/library_type.html). Similarly, strandedness is taken from the sample sheet or calculated automatically, and passed to Kallisto on a per-library basis, but you can apply a global override by setting `kallisto_quant_strandednes` see the [Kallisto documentation](https://pachterlab.github.io/kallisto/manual).
The library preparation protocol (library type) used by Salmon quantification is inferred by the pipeline based on the information provided in the samplesheet, however, you can override it using the `--salmon_quant_libtype` parameter. You can find the available options in the [Salmon documentation](https://salmon.readthedocs.io/en/latest/library_type.html). Similarly, strandedness is taken from the sample sheet or calculated automatically, and passed to Kallisto on a per-library basis, but you can apply a global override by setting `--kallisto_quant_strandedness` see the [Kallisto documentation](https://pachterlab.github.io/kallisto/manual).

When running Salmon in mapping-based mode via `--pseudo_aligner salmon` the entire genome of the organism is used by default for the decoy-aware transcriptome when creating the indices (see second bulleted option in [Salmon documentation](https://salmon.readthedocs.io/en/latest/salmon.html#preparing-transcriptome-indices-mapping-based-mode)).

Two additional parameters `--extra_star_align_args` and `--extra_salmon_quant_args` were added in v3.10 of the pipeline that allow you to append any custom parameters to the STAR align and Salmon quant commands, respectively. Note, the `--seqBias` and `--gcBias` are not provided to Salmon quant by default so you can provide these via `--extra_salmon_quant_args '--seqBias --gcBias'` if required. You can now also supply additional arguments to Kallisto via `--extra_kallisto_quant_args`.

> **NB:** You can use `--skip_alignment --skip_pseudo_alignment` if you only want to run the pre-processing QC steps in the pipeline like FastQ, trimming etc. This will skip alignment, pseudo-alignment and any post-alignment processing steps.
> **NB:** You can use `--skip_alignment --skip_pseudo_alignment` if you only want to run the pre-processing QC steps in the pipeline like FastQ, trimming etc. This will skip alignment, pseudoalignment and any post-alignment processing steps.

## Quantification options

The current options align with STAR and quantify using either Salmon (`--aligner star_salmon`) / RSEM (`--aligner star_rsem`). You also have the option to pseudo-align and quantify your data with Salmon or Kallisto by providing the `--pseudo_aligner salmon` or `--pseudo_aligner kallisto` parameter, respectively.
The current options align with STAR and quantify using either Salmon (`--aligner star_salmon`) / RSEM (`--aligner star_rsem`). You also have the option to pseudoalign and quantify your data with Salmon or Kallisto by providing the `--pseudo_aligner salmon` or `--pseudo_aligner kallisto` parameter, respectively.

Since v3.0 of the pipeline, featureCounts is no longer used to perform gene/transcript quantification, however it is still used to generate QC metrics based on [biotype](http://www.ensembl.org/info/genome/genebuild/biotypes.html) information available within GFF/GTF genome annotation files. This decision was made primarily because of the limitations of featureCounts to appropriately quantify gene expression data. Please see [Zhao et al., 2015](https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0141910#pone-0141910-t001) and [Soneson et al., 2015](https://f1000research.com/articles/4-1521/v1).

Expand Down
6 changes: 5 additions & 1 deletion modules/nf-core/kallisto/quant/main.nf

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

Loading
Loading