-
Notifications
You must be signed in to change notification settings - Fork 65
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
estimate polyA doesn't find tails well with custom primer #1108
Comments
Hi @shaohuishi, I believe the issue could be that your Dorado expects cDNA reads to take the form:
But you have described this format
I believe the solution is to extend the definition of your front primer to include the additional part. [anchors]
front_primer = "AAGCAGTGGTATCAACGCAGAGTACATGGG"
rear_primer = "AAGCAGTGGTATCAACGCAGAGTAC" Kind regards, |
Hi Rich, Thank you for your quick reply, However, we tried providing the complete const bool proceed = flank_score >= threshold && std::abs(dist_v1 - dist_v2) > kMinSeparation in L59, The default threshold So, we changed this threshold To eliminate these reads, we set flank_threshold=0.8
tail_interrupt_length=2 to match the estimation of nanopore error rate at this primer length, and some biological assumptions. After this change, we found that in extreme cases, based on the calculation method of Therefore, in order to use this function more flexibly and effectively, we recommend changing the calculation logic of flank_score to this: float flank_score = 0;
const float top_flank_score_v1 = 1.f - static_cast<float>(top_v1.editDistance) / (front_primer.length())
const float bottom_flank_score_v1 = 1.f - static_cast<float>(bottom_v1.editDistance) / (rear_primer.length())
const float top_flank_score_v2 = 1.f - static_cast<float>(top_v2.editDistance) / (rear_primer.length())
const float bottom_flank_score_v2 = 1.f - static_cast<float>(bottom_v2.editDistance) / (front_primer.length())
if (fwd) {
flank_score = std::min(top_flank_score_v1, bottom_flank_score_v1);
} else {
flank_score = std::min(top_flank_score_v2, bottom_flank_score_v2);
} We also hope that Thank you for your time and suggestions, |
Thank you for your detailed examination! It appears that the cDNA polyA detection does not anticipate that the front and rear primers are so similar, and this arrangement makes distinguishing between a forward and a reverse strand much more difficult. We make ongoing efforts to improve the polyA detection, and we will investigate your suggestions as we move forward. |
Dear developers,
Recently I am trying some customized methods to complete the full-length measurement of the polya tail on nanopore.
However, I don’t know whether it is because the same primer is used at both ends of the cDNA or for other reasons. Many polya that can be recognized by basecaller cannot be counted by estimate-poly-a function.
Thank you for your time,
shaohui
My Library Diagram
5' ---- ADAPTER ---- AAGCAGTGGTATCAACGCAGAGTAC----ATGGG ---- cDNA ---- poly(A) ---- GTACTCTGCGTTGATACCACTGCTT ---- 3'
As sample
read’s signal
bam output
fd4fc9af-6ca3-49d0-9699-2af7174cd85f 4 * 0 0 * * 0 0 TGTGTACGTACTTCGTTCAGCGTATTGCTGCGCACGCACTACAGAAAGCAGTGGTATCAACGCAGAGTACATGGGCTTGTTCTGGGGACATTTCGCGATTGCGGACGTTGAGAGGCCGCTGCCAAAATGCCAGAACGAGATAGTGAGCCCTTCTCTAACCCTTTGGCTCCAGATGGCCACGATGTGGATGATCCTCATTCCTTCCACCAATCAAAACTTACCAATGAAGACTTCAGGAAACTTCTTATGACCCCAAGAGCTGCACCTACTTCTGCGCCACCTTCTAAGTCACGTCACCATGAGATGCCAAGGGAGTACAATGAGGATGAAGACCCAGCTGCACGAAGGAGGAAAAAGAAAAGTTATTATGCCAAGCTTCGCCAGCAAGAAATTGAGAGAGAGAGAACTCGCAGAGAAATACCGGGACCGTGCCAAGGAACGGAGAGATGGTGTGAACAAAGACTATGAGGAAACTGAGCTGATAAGTACCACAGCCAACTACAGGGCTGTGGGCCCCACTGCTGAGGCGGACAAATCAGCAGCAGAGAAGAGAAGACAGTTGATTCAGGAGTCCAAATTCTTGGGTGGTGATATGGAACACGCCCATTTGGTGAAAGGTTTGGATTTTGCGTTGCTTCAAAAGGTGCGCGCTGAGATTGCCAGCAAAGAGAAGGAGGAAGAGGAACTCATGGAAAAGCCCCAAAAGGAAACCAAGAAAGATGAGGATCCTGAGAACAAAATTGAATTTAAAACACGCCTTGGCCGGAATGTGTATCGGATGCTTTTCAAGAGTAAATCATATGAGCGAAATGAGCTGTTCTTACCAGGACGTATGGCCTATGTAGTAGACCTGGATGATGAGTACGCAGACACAGATATCCCCACCACTCTCATACGCAGCAAAGCTGATTGCCCCACTATGGAGGCCCAGACTACACTGACTACAAATGACATTGTTATTAGCAAGCTCACCCAGATTTTGTCATACCTGAGGCAGGGGACCCGAAACAAGAAGCTCAAGAAGAAGGATAAAGGAAAACTGGAAGAGAAGAAACCTCCTGAGGCTGACATGAACATTTTTGAAGACATTGGGGATTACGTTCCTTCTACAACCAAGACACCTCGGGACAAGGAACGTGAGAGATACCGGGAACGTGAACGTGATCGGGAACGGGACAGAGACAGGGAGCGAGACAGGGAGCGAGACCGTGAGAGAGAGGGAGAGAGAGCGAGACCGGGAACGGGAACGAGGAGGAAAAGAAAAGGCACAGCTACTTTGAGAAGCCAAAAGTGGATGATGAGCCCATGGATGTTGACAAAGGACCTGGATCTGCAAAAGAGTTGATCAAGTCCATCAATGAAAAATTCGCTGGGTCTGCTGGCTGGGAAGGCACTGAATCGTTGAAGAAGCCAGAAGATAAGAAGCAGCTGGGCGATTTCTTTGGCATGTCCAACAGTTACGCAGAATGCTATCCAGCCACGATGGATGACATGGCTGTAGATAGTGATGAAGAGGTAGATTATAGCAAAATGGACCAGGGTAACAAGAAGGGTCCCTTAGGCCGCTGGGACTTCGATACTCAGGAGGAAACAGCGAGTACATGAACAACAAGGAGGCTCTGCCCAAGGCTGCATTCCAGTATGGCATCAAGATGTCTGAAGGACGGAAAACCAGACGATTCAAAGAAACCAATGATAAGGCAGAGCTTGATCGACAGTGGAAGAAAATAAGTGCAATCATTGAGAAGAGGAAGAGGATGGAAGCAGATGGGGTCGAAGTGAAAAGACCAAAGTACTAATCTCTAGTTCCAGCTGTCACCACGTGGCTGTTCTTAGTTGCTTGCTTCTACAATTCCTCAGACGGTTGCAAACTGTTGTTGTTTGTGAAAGTTTATAAATGTTTATTGTATAACTCTTTATAGATCTGTGTCCCACATGCTAAGATTAATGGCAATGCAACACCATGTCCAGCATGTTCCATTAAATGTAGTTAAACCTGCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGTACTCTGCGTTGATACCACTGCTTTCTGTAGTGCGTGCGCAGCAATA %&&&'&&%%&'''''(*)++&&&'7888HSSNSQSSSSSSIHSSLKPSSSKSMSSLSSNSSNS::::S:::::NOMLRC@;.-,,,FKLSSMSRSSSKSSSSNSSSKSSSSSSSSLISLSKD@FLMKJMLISSLSSLOSSSPLISRSSSSSSSSSSOSSJNSSSSOIISSGHIISMJSSSLKLPIIJIMMSSSLHKKRSDHMGHHHIIPSSSSSSSSSSPIS::::96=;<7/.(()3DMLSSB@@@@JLOMNSSMMSSSG@@>>10001ABCEFDDCFDJI9333DEJRNSSSOSLSSSMSSSSSSSSSSSSSSMPP===MSSSNKSSSSMSSSMSSSNSSS<;713:;C===FHILQSSSSSSMJSSRLKRSSSSSQSSSSSPJODSSSSSG>=<?3B8HSNSSSSQSSLSJFSSSSMQHGGKNSSPSSSSSSSSSSSOSSSSSSSSSSQSQSSSOSSSSSSSSOSSSSSSSSSSSSSSSSSSSSNSSSSSSSSNSSSSSQSSSSSSSRRLOJLFAAA@?LJLOSSNPNKSSSSMSMKNSSSKGGNJOIHSSLOOJLSSJEFF;;HIHSSOJKLSSSOSSSSSSSSSSQSSSPSSSSLSSSQSNSSSSSSSSSSSSSSSKONJSSSSSSSSSQMMSSSSSSSSNSMSSQNLSGSMMKLSKSPOSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSJSSSSSSSSSSPLRSSSSSSKJJSSSOSSKSSSSSSSSSSSKQSMJKKNSSSSSIOSSSQSSSLRSSSSSLSSSSMNSSNMNRMEDIFFSSSOSSSSSSSSSSMISSSSSSSSSSSSSSSSQSSSSSSSSSKSSSSSSSSOSQSSRSSSSSNSSSSSSSSSSQSSSOSNJSSSSLHKSSSSSSSSSSSQSPSSSSSSOSSSSSIMSNSSSSSSSSSSSJISSSSSSSSSGGGHSSSSSSSSSSSHKSSQSSSLIKLHQMJSSSNSSSSSSSSSSSSSSSSNSSSSKSHSSMLJGSSHSHJHJJILSSSMSSSSSSSRSSSSSSPSJHHIFFFSSSFFGFIGGFHHFSLILSSSSOSJSSNSSSSSSKHQSHOKSKKMPSSSSLLJGJ6334CDB>:43100./3:;<:96666GKOSSFEA@ABASKSHHGGHKNKHOKLHSSSPFOJSPOMIOMSMIK>JSMMHGFJFNSGSIISFCCECHHAIEHJPKSILSF>HDGSSIFFDD7333463429;CA?:9?HE@@@?9998<=IIPSSNSHSKIBB@;@GFOMHKMSSSSSSSSSSMSSSPNMMIOSSSSSSSOMSSSSNSSSSSSSSSPOMJSSSSSPSSQSSLMMSSSSSSSSSSQSMSSSLHIISDBCDDJPSSSQSQSSSSSSSSSSSSSJMSSSSSSRSQSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSRSSSSSSSSSSSSSRSSSSSSSNSSSSSSSHGHSQSQSLSSSSNSLISSSSSPSSSSSSSSSSOSSSOSSSSSSSSSSSPSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSILSSSSSSSSSPMMKSSSSQNSSSSSSSSSOSSSSSSSSSCCCDDSSSMJSSSSSSLMMLSSSSSSSSKSSJFGFHSNSSSNKSSSSMISSSSSSSSSSSSSSSSSSSSNSSSSIHD42436888999:==><>=GSSSSSMSSSSSSSRSSSSSQMSHHKSSSSSSPSSSSSSSSSSSSSJMSSMJISEIE1000007@?CCIMKNSSSSNSSSSPMPSSPMSSSSSSSSSSSSNSSSSSSSSSNQSSSNJHHGIILSSSJSKLSSSSSRSSSSSNSSSSSNLSSOSSSQSSSSSSSLSSSSSNLISSSSSSSOSSSMPSSSSOSRSSSOSSSSSSSNSSSNSRSSSSSSSSSSSSSSSSSSSSSSSPSSSSSSIMSSOSSSSOSSSSSSSSSSSSSSSSSSSSSSSSNSSSSSSSSSSSSJMSSSSLSSSSSSSSSSSSSSSSSLIQLAHSLHMSSSSSSSSQSMJSSSSSS6122374555556667789;=?@BDCGGNKISSPHIHIIHHJJCEDEFSSSFGFHGSSSSSSSSSKGFGHGFJIHIFDCDCBFGIG::92 qs:f:29.0782 du:f:5.587 ns:i:27935 ts:i:10 mx:i:1 ch:i:174 st:Z:2024-08-20T13:54:18.740+00:00 rn:i:22003 fn:Z:PAY63012_pass_2d2e6b65_9300ab27_1974.pod5 sm:f:720.842 sd:f:125.658 sv:Z:pa dx:i:0 RG:Z:9300ab2755c6080067ea07aae5b884a17ba2bb21_dna_r10.4.1_e8.2_400bps_sup@v5.0.0
poly-a-config
Run environment:
The text was updated successfully, but these errors were encountered: