-
Notifications
You must be signed in to change notification settings - Fork 73
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
EDTA ended with the final file '*.mod.EDTA.TEanno.sum' empty #219
Comments
I have the similar error few day ago, the cause of empty '.mod.EDTA.TEanno.sum' file is the RepeatModeler step identifyied New TE classes that the "../util/gff2bed.pl" script program can't properly handled, and lead to New TE classes lines with incorrect formation in '.mod.EDTA.TEanno.bed' file, and finaly cause empty '*.mod.EDTA.TEanno.sum' file. you can try manully add New TE classes‘s matching regular expression type in "../util/gff2bed.pl" script, some like "$type = "DIRS" if $sequence_ontology =~ /DIRS/i" in line 51-60, and rerun EDTA with ".. --step anno --anno 1 --evaluate 1 --species others --step all --overwrite 0 --sensitive 1 --force 1". |
The latest version (v1.9.9) should have this issue resolved. - Shujun |
You may install the conda version, then clone the github to your server and
you will get the latest versio. -Shujun
…On Wed, Sep 8, 2021 at 11:13 PM hlkfoz ***@***.***> wrote:
The latest version (v1.9.9) should have this issue resolved. - Shujun
Hi shujun!
How can i download The latest version (v1.9.9) , I can't find The latest
version (v1.9.9) in github.com, the lastest release is v1.9.6.
—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub
<#219 (comment)>, or
unsubscribe
<https://github.com/notifications/unsubscribe-auth/ABNX4NF52RGRKAF3JWORWKTUBAYABANCNFSM5DVIMMLA>
.
Triage notifications on the go with GitHub Mobile for iOS
<https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675>
or Android
<https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub>.
|
Thanks! I will try this one and shujun's advice after my program finish. |
Thank you, Shujun. I will try it later. |
Hi Shujun, I have tried to re-annotation using EDTA v1.9.9, but that error is still here. I used the method @hlkfoz adviced, to modify "./util/gff2bed.pl" script like this, it seems everything goes well. 51 $type = "Cent" if $sequence_ontology =~ /Cent/i; 52 $type = "knob" if $sequence_ontology =~ /knob/i; 53 $type = "LINE" if $sequence_ontology =~ /LINE|RIL/i; 54 $type = "SINE" if $sequence_ontology =~ /SINE|RIS/i; 55 $type = "nonLTR" if $sequence_ontology =~ /non_LTR/i; 56 $type = "rDNA" if $sequence_ontology =~ /rDNA/i; 57 $type = "satellite" if $sequence_ontology =~ /satellite/i; 58 $type = "low_complexity" if $sequence_ontology =~ /low_complexity/i; 59 $type = "telomere" if $sequence_ontology =~ /telomer/i; 60 $type = "subtelomere" if $sequence_ontology =~ /subtelomer/i; 61 $type = "Helitron" if $sequence_ontology =~ /Helitron|DHH/i; 62 $type = "Penelope" if $sequence_ontology =~ /Penelope|RPP/i; 63 $type = "repeat_region" if $sequence_ontology =~ /repeat_region/i; **64 $type = "DIRS" if $sequence_ontology =~ /DIRS/i; 65 #line 64 is modified according to the reference solution** 66 $type = 'repeat_region' if $sequence_ontology =~ /Unknown/i; #suggested by Changfu Jia 67 $type = $1 if $sequence_ontology =~ /^(.)/./ and $1 !~ /DNA|MITE/i;` |
@zhxh233 if you check the code of /util/gff2bed.pl on the current github (v1.9.9), it has the exact same code you posted above. I suscept you didn't use the latest version even you have downloaded it (i.e., you need to specify the path to your downloaded code). - Shujun |
######################################################## Extensive de-novo TE Annotator (EDTA) v1.9.9Shujun Ou (shujun.ou.1@gmail.com)######################################################## 2021年 09月 09日 星期四 12:26:11 CST Dependency checking: 2021年 09月 09日 星期四 12:26:14 CST Obtain raw TE libraries using various structure-based programs: 2021年 09月 09日 星期四 12:26:17 CST Start to find LTR candidates. 2021年 09月 09日 星期四 12:26:17 CST Identify LTR retrotransposon candidates from scratch. 2021年 09月 09日 星期四 12:32:25 CST Finish finding LTR candidates. 2021年 09月 09日 星期四 12:32:25 CST Start to find TIR candidates. 2021年 09月 09日 星期四 12:32:25 CST Identify TIR candidates from scratch. Species: others 2021年 09月 09日 星期四 12:59:33 CST Start to find Helitron candidates. 2021年 09月 09日 星期四 12:59:33 CST Identify Helitron candidates from scratch. 2021年 09月 09日 星期四 13:51:01 CST Finish finding Helitron candidates. 2021年 09月 09日 星期四 13:51:01 CST Execution of EDTA_raw.pl is finished! 2021年 09月 09日 星期四 13:51:01 CST Obtain raw TE libraries finished. 2021年 09月 09日 星期四 13:51:01 CST Perform EDTA advance filtering for raw TE candidates and generate the stage 1 library: 2021年 09月 09日 星期四 13:54:24 CST EDTA advance filtering finished. 2021年 09月 09日 星期四 13:54:24 CST Perform EDTA final steps to generate a non-redundant comprehensive TE library:
2021-09-09 18:40:15,186 -WARNING- Grid computing is not available because DRMAA not configured properly: Could not find drmaa library. Please specify its full path using the environment variable DRMAA_LIBRARY_PATH Penelope/NA not found in the TE_SO database, it will not be used to rename sequences in the final annotation. 2021年 09月 09日 星期四 18:44:30 CST Homology-based annotation of TEs using PhEs.test.fasta.mod.EDTA.TElib.fa from scratch. Use of uninitialized value $type in concatenation (.) or string at /home/biolinux/miniconda3/envs/EDTA/share/EDTA/util/gff2bed.pl line 85, line 485. 2021年 09月 09日 星期四 18:59:14 CST Evaluate the level of inconsistency for whole-genome TE annotation (slow step): Use of uninitialized value in pattern match (m//) at /home/biolinux/miniconda3/envs/EDTA/share/EDTA/util/call_seq_by_list.pl line 88. I think the latest gff2bed.pl script still not handled "DIRS" type properly . |
I downloaded another copy of the latest version from github on my Windows computer (which never installed or downloaded EDTA before) to check the code, in EDTA.pl, the version number is 1.9.9, while in gff2bed.pl, there is indeed no some code in the part "determine $type for struc-homo TE annotation merging" as @hlkfoz , like this: ` ` the verision of gff2bed.pl is v0.3, 07/22/2020, so maybe there is still something wrong ? |
@zhxh233 Sorry for the delayed reply. What errors were you suggesting? Did you get results with the v1.9.9 conda version? - Shujun |
Thanks for reply. After I edited the EDTA.pl script as others' suggestion, the script can run normally. My last commented just said that I found the source code of EDTA.pl on github which version is 1.9.9, it still has no the code |
Thanks for making that clear. I found the issue. The correct fix would be replacing line 43 with The fix will be updated in the next version. Thanks for helping to debug. Best, |
Thank you for your great tools ! : ) I'll try this fix later. |
Hi Shujun, Thank you for developing this excellent package. I'm getting similar errors with EDTA v2.2.1 (
It did work successfully after I skipped I cloned EDTA from github and installed the dependencies with mamba. The gff2bed.pl version is 0.6. Is there any fix for this? Best wishes, |
Hi shujun!
I am using EDTA as several weeks ago (with --anno 1 --evaluate 1 --species others --step all --overwrite 0 --sensitive 1 --force 1
), but this time the script ended with *.mod.EDTA.TEanno.sum empty. I have tried to short the sequence name and the file name or without "--force 1" parameter, as you mentioned in other issues, but it didn't work.
a) Use of uninitialized value $type in concatenation (.) or string at (GFF file) (many lines of this one)
b) Use of uninitialized value $chr_pre in hash element at /beegfs/store3/zhangxiaohang/software/EDTA-master/util/call_seq_by_list.pl line 90.
c) Use of uninitialized value $pos in pattern match (m//) at /beegfs/store3/zhangxiaohang/software/EDTA-master/util/call_seq_by_list.pl line 100.
d) Use of uninitialized value $pos in concatenation (.) or string at /beegfs/store3/zhangxiaohang/software/EDTA-master/util/call_seq_by_list.pl line 103.
e) ERROR: Can not recognize this MSU position in the list!
f) ERROR: TE annotation stats results not found in ${myfile}.mod.EDTA.TE.fa.stat!
So, how can I fixed it? Or if I need the output_gff at least, should I use the *EDTA.intact.gff3 or *EDTA.TEanno.gff3 or another file in some folders or all these files may be with some mistakes?
Thank you! : )
The text was updated successfully, but these errors were encountered: