Skip to content

FASTA Format

Miguel Amezola edited this page Sep 24, 2016 · 6 revisions

A sequence in FASTA format begins with a single-line description, followed by lines of sequence data. The description line is distinguished from the sequence data by a greater-than (">") symbol in the first column. It is recommended that all lines of text be shorter than 80 characters in length.

The accepted amino acid codes are: A ALA alanine P PRO proline B ASX aspartate or asparagine Q GLN glutamine C CYS cystine R ARG arginine D ASP aspartate S SER serine E GLU glutamate T THR threonine F PHE phenylalanine U selenocysteine G GLY glycine V VAL valine H HIS histidine W TRP tryptophan I ILE isoleucine Y TYR tyrosine K LYS lysine Z GLX glutamate or glutamine L LEU leucine X any M MET methionine * translation stop N ASN asparagine - gap of indeterminate length

Clone this wiki locally