Skip to content

FASTA Format

Miguel Amezola edited this page Sep 24, 2016 · 6 revisions

The following information is from http://zhanglab.ccmb.med.umich.edu/FASTA/

"A sequence in FASTA format begins with a single-line description, followed by lines of sequence data. The description line is distinguished from the sequence data by a greater-than (">") symbol in the first column. It is recommended that all lines of text be shorter than 80 characters in length."

The accepted amino acid codes are:

Code Amino acid
A ALA alanine
B ASX aspartate or asparagine
C CYS cystine
D ASP aspartate
E GLU glutamate
F PHE phenylalanine
G GLY glycine
H HIS histidine
I ILE isoleucine
K LYS lysine
L LEU leucine
M MET methionine
N ASN asparagine
P PRO proline
Q GLN glutamine
R ARG arginine
S SER serine
T THR threonine
U selenocysteine
V VAL valine
W TRP tryptophan
Y TYR tyrosine
Z GLX glutamate or glutamine
X any
* translation stop
- gap of indeterminate length
Clone this wiki locally