-
Notifications
You must be signed in to change notification settings - Fork 73
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Happy EDTA users with successful cases #15
Comments
Thanks for developing EDTA, and this is for your reference. Fish genome ~480 Mb, EDTA.pl (commit 82b16f6) job finished in 7h.
Cheers, |
Hi Finally, my rerun without splitting EDTA_raw.pl into subsets finished too :) It's an amphidiploid plant genome, about 1GB in size, run with 15 threads, commit 757a96d (the one where MITEHunter is turned off) It ran for 242.8 hours (10.1 days) if run sequentially (so no splitting up of EDTA_raw.pl steps) Here's the whole log so you can see which steps took how long: Wed Sep 4 21:07:28 AWST 2019 EDTA_raw: Check files and dependencies, prepare working directories. Wed Sep 4 21:07:28 AWST 2019 Start to find LTR candidates. Wed Sep 4 21:07:28 AWST 2019 Identify LTR retrotransposon candidates from scratch. Tue Sep 10 00:34:18 AWST 2019 Finish finding LTR candidates. Tue Sep 10 00:34:18 AWST 2019 Start to find TIR candidates. Tue Sep 10 00:34:18 AWST 2019 Identify TIR candidates from scratch. Species: others rm: cannot remove `./TIR-Learner/*': No such file or directory Finish finding TIR candidates. Fri Sep 13 09:13:15 AWST 2019 Start to find MITE candidates. Fri Sep 13 09:13:15 AWST 2019 Identify MITE candidates from scratch. Fri Sep 13 09:13:15 AWST 2019 Warning: Because MITE-Hunter is too slow and only contribute limited new TIR candidates, it is taken down temporary until a better solution is found. Fri Sep 13 09:13:15 AWST 2019 Finish finding MITE candidates. Fri Sep 13 09:13:15 AWST 2019 Start to find Helitron candidates. Fri Sep 13 09:13:15 AWST 2019 Identify Helitron candidates from scratch. Fri Sep 13 23:32:36 AWST 2019 Finish finding Helitron candidates. Fri Sep 13 23:32:36 AWST 2019 Execution of EDTA_raw.pl is finished! Sun Sep 15 07:17:49 AWST 2019 EDTA basic and advcanced filters finished. Sun Sep 15 07:17:49 AWST 2019 Perform EDTA final steps to generate a non-redundant comprehensive TE library: Skip the RepeatModeler step (-sensitive 0). Run EDTA.pl -step final -sensitive 1 if you want to use RepeatModeler. Sun Sep 15 14:01:06 AWST 2019 EDTA final stage finished! Check out the final EDTA TE library: ragoo.fasta.EDTA.TElib.fa Sizes:
The final TElib has 22,921 sequences. Running a final RepeatMasker run with the final TElib gives me about 75% of the genome being repeats, which makes me very happy :) I should check how many of these repeats are known within Dfam and Repbase, my guess is not that many as a previous run with an older Repbase version gave me a much lower count of repeats MUCH LATER EDIT: I've now rerun the final stage with Mon Sep 30 17:04:54 AWST 2019 Perform EDTA final steps to generate a non-redundant comprehensive TE library: Use RepeatModeler to identify any remaining TEs that are missed by structure-based methods. Sat Oct 5 14:51:32 AWST 2019 EDTA final stage finished! Check out the final EDTA TE library: ragoo.fasta.EDTA.TElib.fa So yeah, adding this step to my pipeline added 5 days of runtime! What this did is add 4,186 repeats of type 'unknown' to my TElib.fa, but these are all tiny (the total filesize increased from 46MB to 47MB). @oushujun is this a problem on my end? Should RepeatModeler have assigned classes to those repeats? |
(EDTA) v1.5 |
Thank you so much for making this awesome tool! It makes TA annotation so much easier and I will for sure use it for my next genomes. I tried it on a bird (~1.2Gb) with
(It probably would have been faster if I had given more threads to the TIR search instead of splitting equally between the three searches!) Thank you, |
I recently finished one plant genome about 1.8 Gb.
|
Thank you for the awesome tool. Successfully finished three plant genomes (~3-3.5 Gb):
Raw steps run with EDTA v1.7.8 from bioconda, the rest with EDTA v1.8.5. All run with 48 threads (Intel Platinum 8160F Skylake @ 2.1Ghz, RAM=192000M), --sensitive 0 in the FINAL step and CDS provided. Cheers, |
Hi, Thanks for developing such a cool software,I have successfully annotated the TEs of an insect genome. But I want to known if the library constructed by EDTA could be used as the consensus sequences for calculating the age of individual TEs? Thank you, |
Dear Yang Yi,
Thanks for using EDTA. Unfortunately it does not generate consensus
sequences. Those in the library are nonredundant sequences taken directly
from TEs.
Best,
Shujun
…On Tue, Jul 14, 2020 at 9:30 AM yangyi09 ***@***.***> wrote:
Hi,
Thanks for developing such a cool software,I have successfully annotated
the TEs of an insect genome.
But I want to known if the library constructed by EDTA could be used as
the consensus sequences for calculating the age of individual TEs?
Thank you,
Yang yi
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#15 (comment)>, or
unsubscribe
<https://github.com/notifications/unsubscribe-auth/ABNX4NDGSB7TDS6AD765DF3R3RTWRANCNFSM4IOBNCDQ>
.
|
Thank you all for using and promoting EDTA since its early stage. As for now, the program has gained some popularity, and works using EDTA have been showing up online. This thread can be retired. Below are some studies citing EDTA: Homologous chromosomes in asexual rotifer Adineta vaga suggest automixis The transposable elements of the Drosophila serrata reference panel Chromosome-level Genome Assembly of a Regenerable Maize Inbred Line A188 |
Dear all, Shujun Thank you for the awesome tool. Successfully finished one plant (Rhododendron)genome (~850M):
command The sub-item time is not very clear (it was interrupted several times in the middle), and the total time is about 20h. Sincerely, |
Hi all,
Just update the testing result. It seems that new release TIR can close this issue.
Thanks for the developing.
Bests,
Zhigui
Originally posted by @baozg in #4 (comment)
The text was updated successfully, but these errors were encountered: