-
Notifications
You must be signed in to change notification settings - Fork 191
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
primary_transcripts.py Not working #363
Comments
Hi Pedro Sorry, that tool is for proteomes downloaded from Ensembl, which labels isoforms according to which gene locus they come from. For transcriptomes which aren't labelled like that you won't be able to use the primary_transcript.py script. If you are able to just provide input files to OrthoFinder that have just one transcript per gene that would help--for example, if you assembled the transcriptomes using Trinity then it will have the information for you to do that. Otherwise, just provide your transcriptome fasta files as input to OrthoFinder. All the best |
Hi David, Thank you for the quick answer, I didn't notice that the primary_script.py script was meant to work with Ensembl data. I do have some assembled transcriptomes and was able to use the aminoacid fasta files (.faa) in a test run, which seemed to work fine. When you say to use the transcriptome fasta files, do you mean the nucleotide fasta? It would be interesting to test it, since the assembly is the most time consuming step. Best regards, |
Hi Pedro No, it's just amino acid sequences at the moment. We are looking into using assembled nucleotide sequences but if any such feature were added, it would be quite a way in the future. All the best |
Hi David, Traceback (most recent call last): My input file is downloaded from ensembl, and it looks like:
Any suggestions will be highly appreciated! Yixin |
For anyone meets the same issue. |
Hi This issue has been resolved, see here: #345 You can just download the latest version of the script from the master branch on github and use that. I'll create a new release in the coming days that contain all the latest changes. All the best |
Hi @davidemms, I'm having a similar issue and can't fix it with any of the solutions you've mentioned. When I try to run primary trancripts on my files, it looks like it's running but nothing actually happens. Running
This is what the file looks like:
Any advice on how I can run it? |
Greetings,
I'm currently trying to find orthologs/orthogroups in a group of algae species of the same genus from a set of transcriptomes. I found Orthofinder while looking for ways to do this and it seemed to be one of the most complete tools out there, so I decided to try it out.
While following the tutorial to learn how to use it, I came across the following ror when using the primary_transcript.py script:
Traceback (most recent call last): File "/home/pedro/OrthoFinder/tools/primary_transcript.py", line 147, in <module> main(args) File "/home/pedro/OrthoFinder/tools/primary_transcript.py", line 143, in main CreatePrimaryTranscriptsFile(fn, dout) File "/home/pedro/OrthoFinder/tools/primary_transcript.py", line 63, in CreatePrimaryTranscriptsFile if not line.startswith(">"): continue
I'm new to computational work and a beginner when it comes to Python in particular, so I have no clue what the problem might be.
Thanks in advance.
The text was updated successfully, but these errors were encountered: