Map non noisy ONT #1127

Axze-rgb · 2023-11-08T21:06:48Z

Hello,

I have a question: according to Oxford nanopore their last cells produce very accurate reads. Does "map-ont" still work as the best setting to map those reads? I am asking because the manual still refers to "long noisy reads". Thanks for minimap2 and for your time.

Axze-rgb · 2023-11-10T09:07:08Z

Sorry I hadn't seen issue

#1030 (comment)

So, I understand that Dorado is accounted for now in the map-ont settings?

Thanks for all the work you are doing.
Alex

iiSeymour · 2023-11-10T10:33:06Z

@Axze-rgb dorado aligner has not yet changed any of the index settings and when we do we would like them upstreamed here.

lh3 · 2023-11-10T15:14:54Z

For now, use map-ont. You can try -x map-hifi -w10 (HiFi scoring and k-mer length with more seeds) for Q20 reads but you need to have a way to evaluate whether that gives better results.

I hope I can find some time in the next several months to improve minimap2 a little bit. Along this I will be testing alternative scoring for v14 data.

lh3 · 2023-11-10T15:19:39Z

@iiSeymour When you find more appropriate parameters for aligning Q20 reads, I will be happy to add a new preset for that. This will also save me some time. Thanks!

Checunmily · 2024-01-22T09:40:28Z

For now, use map-ont. You can try -x map-hifi -w10 (HiFi scoring and k-mer length with more seeds) for Q20 reads but you need to have a way to evaluate whether that gives better results.

I hope I can find some time in the next several months to improve minimap2 a little bit. Along this I will be testing alternative scoring for v14 data.

hello, recently I've been dealing with some R10 data and I want to know if there are any plans to make some improvements of minimap2 on ONT R10 in the next few months? Or any new suggestions for R10 data?

iiSeymour · 2024-03-04T18:06:24Z

@lh3 from our internal benchmarking we find speed and downstream accuracy are maximized with -x map-ont -k19 -w 19 -U50,500 -g10k.

Mon3trK · 2024-03-05T03:44:03Z

For now, use map-ont. You can try -x map-hifi -w10 (HiFi scoring and k-mer length with more seeds) for Q20 reads but you need to have a way to evaluate whether that gives better results.

I hope I can find some time in the next several months to improve minimap2 a little bit. Along this I will be testing alternative scoring for v14 data.

Hi @lh3, accuracy of ONT sequencing has advanced a lot from duplex or R10.4 pore. I also wonder if there is any plan for setting different preset for R9 and R10 nanopore? And also different basecallers have significant impact on sequencing accuracy, it seem unappropriate to just mixed in -x map-ont.

lh3 · 2024-03-05T04:26:01Z

from our internal benchmarking we find speed and downstream accuracy are maximized with -x map-ont -k19 -w 19 -U50,500 -g10k.

-x map-hifi is equivalent to -x map-ont -k19 -w 19 -U50,500 -g10k -A1 -B4 -O6,26 -E2,1 -s200. The main difference here is the scoring. How scoring affects the downstream tools? If the map-hifi scoring also works, I can add an alias to map-hifi, something like lr:hq.

also different basecallers have significant impact on sequencing accuracy

That is why it is more appropriate to choose a conservative setting that can give you good results on input of varying quality.

iiSeymour · 2024-03-05T15:00:37Z

If the map-hifi scoring also works

Unfortunately not, the map-hifi scoring leads to both fewer mapped reads (~3%) and small regressions in SNP/INDEL calling. It's possible these regressions could be recovered from new models trained on updated scoring parameters but it seems -x map-ont -k19 -w 19 -U50,500 -g10k is the sweet spot.

* Added the lr:hq preset suggested by Nanopore developers (#1127) * Fixed transition scoring. It did not work with presets. * Cleaned up preset documentation

lh3 · 2024-03-12T00:19:21Z

The next release will have a lr:hq preset for -k19 -w 19 -U50,500 -g10k.

bepoli · 2024-03-13T09:16:44Z

Thanks @lh3 !
I understand that the new preset lr:hq is not meant for spliced alignment.
Should I use the existing preset splice:hq with highly accurate Nanopore cDNA reads? (with average quality >= 20)

lh3 · 2024-03-13T12:58:09Z

Yes

lh3 · 2024-03-13T20:50:29Z

I will hijack the thread and ask a question here: are there public Q20 cDNA-seq data? Perhaps because the SQK-PCS114 kit still at the early-access stage, most cDNA reads in papers were produced with R9 or older kits.

dolittle007 · 2024-03-14T03:49:18Z

Hi @lh3, I have PacBio HiFi Iso-Seq data, should I use the existing preset splice:hq along with the new preset lr:hq, or I can just use -k19 -w 19 -U50,500 -g10k -xsplice -C5 -O6,24 -B4?
Thanks a lot.

jelber2 · 2024-03-14T11:05:36Z

The next release will have a lr:hq preset for -k19 -w 19 -U50,500 -g10k.

Shouldn't it be
-x map-ont -k19 -w 19 -U50,500 -g10k ? According to @iiSeymour

FatYuanBao · 2024-03-21T04:21:02Z

@iiSeymour I noticed the latest Minimap2-2.27 (r1193) includes an updated lr:hq preset. I conducted a small benchmark between this new preset and the old map-ont preset on a human R10.4.1 database using dorado 0.4.1 in HAC mode.

For -x map-ont:

19072496 + 0 mapped (99.93% : N/A)
12791592 + 0 primary mapped (99.90% : N/A)

For -x lr:hq:

18636130 + 0 mapped (99.79% : N/A)
12765068 + 0 primary mapped (99.69% : N/A)

It appears that there are fewer mapped reads (~0.14%) with the new lr:hq preset. Considering the relatively high coverage (>50X) of this data, this difference could be significant.

lh3 · 2024-03-21T04:42:32Z

Read count-based metrics are often misleading. The difference mostly comes from short reads and low-quality reads that may interfere with analyses on the contrary. PS: also, not all reads are supposed to get mapped to a reference genome.

dolittle007 · 2024-03-22T17:24:29Z

The next release will have a lr:hq preset for -k19 -w 19 -U50,500 -g10k.

Shouldn't it be -x map-ont -k19 -w 19 -U50,500 -g10k ? According to @iiSeymour

Thanks a lot. @jelber2 splice:hq works for RNA and lr:hq works for DNA.

preset lr:hq => -x map-ont -k19 -w 19 -U50,500 -g10k
preset splice:hq => -x splice -C5 -O6,24 -B4
preset splice => -x map-ont -k15 -w5 --splice -g2k -G200k -U10,1000000 -A1 -B2 -O2,32 -E1,0 -b0 -C9 -z200 -ub --junc-bonus=9 --cap-sw-mem=0 --splice-flank=yes

So parameters from lr:hq and splice:hq will cause conflicts.

camillaugolini-iit · 2024-10-01T10:57:38Z

Hello,

@lh3 and @iiSeymour, as far as I understood, splice:hq is the best option for R10 Nanopore cDNA reads.
Would it be optimal also for the new RNA004 ?
In other words, which setting would you use to optimally align reads from the new RNA pore to a genomic and a transcriptomic reference?

Thank you for your time

camillaugolini-iit · 2024-10-01T11:30:06Z

Also, if provided a --junc-bed file, would this have any conflict with the splice:hq options?

dolittle007 · 2024-10-02T00:55:06Z

@camillaugolini-iit Using the --junc-bed option, minimap2 prioritizes splicing events based on the provided annotations. It will not cause any conflict with splice:hq options.

lh3 added the enhancement label Nov 10, 2023

Axze-rgb mentioned this issue Feb 17, 2024

R10.4.1 Nanopore preset #1156

Closed

lh3 added a commit that referenced this issue Mar 10, 2024

r1183: added lr:hq; fixed transition

8140259

* Added the lr:hq preset suggested by Nanopore developers (#1127) * Fixed transition scoring. It did not work with presets. * Cleaned up preset documentation

lh3 closed this as completed Mar 12, 2024

EricKutschera mentioned this issue Aug 21, 2024

collapsed isoforms Xinglab/espresso#71

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Map non noisy ONT #1127

Map non noisy ONT #1127

Axze-rgb commented Nov 8, 2023

Axze-rgb commented Nov 10, 2023

iiSeymour commented Nov 10, 2023

lh3 commented Nov 10, 2023 •

edited

Loading

lh3 commented Nov 10, 2023

Checunmily commented Jan 22, 2024

iiSeymour commented Mar 4, 2024

Mon3trK commented Mar 5, 2024 •

edited

Loading

lh3 commented Mar 5, 2024

iiSeymour commented Mar 5, 2024

lh3 commented Mar 12, 2024

bepoli commented Mar 13, 2024

lh3 commented Mar 13, 2024

lh3 commented Mar 13, 2024

dolittle007 commented Mar 14, 2024

jelber2 commented Mar 14, 2024

FatYuanBao commented Mar 21, 2024

lh3 commented Mar 21, 2024 •

edited

Loading

dolittle007 commented Mar 22, 2024

camillaugolini-iit commented Oct 1, 2024

camillaugolini-iit commented Oct 1, 2024

dolittle007 commented Oct 2, 2024

Map non noisy ONT #1127

Map non noisy ONT #1127

Comments

Axze-rgb commented Nov 8, 2023

Axze-rgb commented Nov 10, 2023

iiSeymour commented Nov 10, 2023

lh3 commented Nov 10, 2023 • edited Loading

lh3 commented Nov 10, 2023

Checunmily commented Jan 22, 2024

iiSeymour commented Mar 4, 2024

Mon3trK commented Mar 5, 2024 • edited Loading

lh3 commented Mar 5, 2024

iiSeymour commented Mar 5, 2024

lh3 commented Mar 12, 2024

bepoli commented Mar 13, 2024

lh3 commented Mar 13, 2024

lh3 commented Mar 13, 2024

dolittle007 commented Mar 14, 2024

jelber2 commented Mar 14, 2024

FatYuanBao commented Mar 21, 2024

lh3 commented Mar 21, 2024 • edited Loading

dolittle007 commented Mar 22, 2024

camillaugolini-iit commented Oct 1, 2024

camillaugolini-iit commented Oct 1, 2024

dolittle007 commented Oct 2, 2024

lh3 commented Nov 10, 2023 •

edited

Loading

Mon3trK commented Mar 5, 2024 •

edited

Loading

lh3 commented Mar 21, 2024 •

edited

Loading