An open source and cross platform application to fix, and find problems in protein FASTA sequence files.
tidyFASTA is a cross platform application (Windows, Mac OS X, Linux) and is available free of charge and without registration.
Problem | Fix (Strict mode) |
---|---|
Sequence without ID | ID name added |
Multiline sequence | One line per sequence |
ID without sequence | Sequence ignored (Exception raised) |
Non canonical AA | Sequence ignored (Exception raised) |
Lowercase AA | Converts to uppercase AA |
Excessive Whitespace | Removes excessive whitespace |
- Install java (if required)
- If Java is not already installed on your computer (version 11 or greater), it is available here for all major operating systems.
- Download the jar file from the release tab.
- Run the tidyFASTA-gui file by opening the file, or from the command line.
tidyFASTA is open source and all processing is run entirely on your local machine.
tidyFASTA is also available as a python package.