fix errors in miRDeep2 analysis when reads map to unplaced contigs in $refgenome #100

Daniel-VM · 2021-07-27T08:30:25Z

This pull request attempts to fix a potential corner case in STEP 7.2 miRDeep2:

Some miRNA reads could map to unplaced contigs (ie: ">chr11_KI270721v1_random") in the reference genome
($refgenome). In such situation, removing "_" from the sequence ID of $refgenome leads to mismatch with
the chromosome IDs listed in the *.arf file ($reads_vs_refdb). Example:

    genome_nowhitespace.fa: >chr11KI270721v1random ($refgenome after removing underscore)
    $reads_vs_refdb: chr11_KI270721v1_random

Therefore, mirdeep2.pl doesn't identify the mapped read ($reads_vs_refdb) in the edited reference genome causing the following error:

""
Command error:
#Starting miRDeep2
[...]

The mapped reference id chr11_KI270721v1_random from file *_reads_vs_refdb.arf is
    not an id of the genome file genome_nowhitespace.fa
     [...]

""
PROPOSAL:

Avoid "_" removal with awk in STEP 7.2 miRDeep2
In my case, this modification solves the above error.
if "_" is removed from $refgenome, then apply the same modification to the *.arf file in the $6 field (chromosome ID column) in order to preserve the chromosome's ID correspondence.

PR checklist

This comment contains a description of changes (with reason).
If you've fixed a bug or added code that should be tested, add tests!
- If necessary, also make a PR on the nf-core/smrnaseq branch on the nf-core/test-datasets repository.
Make sure your code lints (nf-core lint .).
Ensure the test suite passes (nextflow run . -profile test,conda).

PR for release 1.1.0

…n refgenome

KevinMenden and others added 2 commits June 15, 2021 14:44

Merge pull request nf-core#85 from nf-core/dev

03333bf

PR for release 1.1.0

fix errors in miRDeep2 analysis when reads maps to unplaced contigs i…

78d6737

…n refgenome

Daniel-VM changed the title ~~fix errors in miRDeep2 analysis when reads maps to unplaced contigs in $refgenome~~ fix errors in miRDeep2 analysis when reads map to unplaced contigs in $refgenome Jul 27, 2021

ewels merged commit 4b7fbcf into nf-core:dev Aug 5, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix errors in miRDeep2 analysis when reads map to unplaced contigs in $refgenome #100

fix errors in miRDeep2 analysis when reads map to unplaced contigs in $refgenome #100

Daniel-VM commented Jul 27, 2021

fix errors in miRDeep2 analysis when reads map to unplaced contigs in $refgenome #100

fix errors in miRDeep2 analysis when reads map to unplaced contigs in $refgenome #100

Conversation

Daniel-VM commented Jul 27, 2021

PR checklist