-
Notifications
You must be signed in to change notification settings - Fork 54
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
No Splicing Events written in any output file,although in stdout I can see many that were counted. #89
Comments
If you are able to use rmats version 4.1.1 then the output will contain a section that starts with "read outcome totals across all BAMs". That section shows how many of the input reads were not able to be used along with the reason why. That output should help identify the reason for the empty output files |
Thanks Eric ! I launched 4.1.1, and found the following information in stdout:
I have results of Splicing events now, because I corrected readLength =101 from =100(!!!), when I saw NOT_EXPECTED_READ_LENGTH: 458607844 ! Can you tell me though, what the other parameters mean, and if there is something else bad in my bams that I can take care of ? |
NOT_NH_1
And from the STAR manual: https://raw.githubusercontent.com/alexdobin/STAR/master/doc/STARmanual.pdf
NOT_EXPECTED_CIGAR EXON_NOT_MATCHED_TO_ANNOTATION JUNCTION_NOT_MATCHED_TO_ANNOTATION |
Thanks so much for the explanations. I ll see about the non-annotated exons,if we are interested. Very nice improvement of the latest version of rMATS anyway ! |
rMATS turbo v4.1.1 is available on bioconda: https://anaconda.org/bioconda/rmats |
From the read outcomes about 20% of the reads are CLIPPED. You can use those reads by passing --allow-clipping to rmats About 15% of the reads are NOT_PAIRED which means the aligner did not mark them as properly paired The main issue seems to be EXON_NOT_MATCHED_TO_ANNOTATION and JUNCTION_NOT_MATCHED_TO_ANNOTATION which together are about 50% of the reads. Those outcomes happen when the read passes the other filters but rmats is unable to match up the alignment to the annotated exons and splice sites. That could happen if the --gtf given to rmats doesn't agree with the reference files used to align the reads. Another potential cause of NOT_MATCHED_TO_ANNOTATION is that the reads just don't align well with the reference. Since you are using a non-model species maybe the reference files don't match the actual exons in your data |
Hi EricKutschera, thanks for your reply, yesterday I was ging trough the documentation and I added the --allow-clipping to the code, nothing changed, it continues to give : Done processing each gene from dictionary to compile AS events but when I check the result files, I only have headers again. The thing is I mapped my paired-end G. gerbillus RNAseq samples to unguiculatus with HISAT2, and even though the alignment was like of 50% something almost all the proteins in the used gff at the time were mapped (23000 something). the thing is, we actually know there is splicing events here, because I have in parallel a colaborator that used star and got a higher mapping and found splicing events in the exact same samples with the exact same species... Thanks why I think its very strange because if the results are shown and say that there are splicing evens, shouldn´t they be saved in the output text files ? Thanks again for your kind reply, help is much appreciated |
rmats can detect events using information from the GTF and from the bams. There are different files that report the events detected based on different information as described in the README: https://github.com/Xinglab/rmats-turbo/tree/v4.1.2#output
If the output has |
Hi . I also used HISAT2 and the same with you ! |
gtf: 175.58477663993835 read outcome totals across all BAMs |
The output shows a lot of used reads |
I have rMATS verison 4.3.0, and I am also getting similar issues, where rMATS seem to show results and not show any errors, yet the output files, esp. JC and JCEC ones, are empty. Does it have to be version 4.1.1 specifically? |
4.3.0 also shows the read outcome totals. They are printed to stdout and also written to the |
@EricKutschera They do produce those. But still, when I ran rMATS through conda run rmats.py, using my USCS-formatted BAM files, only the "fromGTF.[AS event].txt" file produced information, while the rest: "fromGTF.novelSpliceSite.[AS event].txt," "JC.raw.input.[AS event}.txt," "JCEC.raw.input.[AS event].txt," "[AS event].MATS.JC.txt" and "[AS event].MATS.JCEC.txt" files only produced the header while the rest is blank. I was asking if I should lower the rMATS version to 4.1.1, or is 4.3.0 shouldn't be making any of these issues? I'll attach some files here to show you what I meant (all the rMATS files as .txt, and RI-focused results just to not overload them, and since that one's specific to our research): |
From the read outcomes file, only about 2,000 reads were USED out of about 800 million. About 65% were filtered for If your reads are a fixed length you can double check the value of |
Hello,
I find it really weird to obtain empty files in results folder (A3SS.MATS.JC/JCEC, etc.)
while no Error is poping up, and I am seeing in my slurm.out the following :
Only the " fromGTF." files per splicing type are not empty.
The others have only headers without any gene record.
Input are BAM files from STAR alignment, gtf is given same as one for STAR, there are 3 replicates per condition, and I run in cluster with 16 cpus-per-task.
My command :
Could sb give me any hints on what may be happening and how to find out ?
The text was updated successfully, but these errors were encountered: