-
Notifications
You must be signed in to change notification settings - Fork 0
FASTA Format
The following information is from http://zhanglab.ccmb.med.umich.edu/FASTA/
A sequence in begins with a single-line description, followed by lines of sequence data. The description line is distinguished from the sequence data by a greater-than (">") symbol in the first column.
The accepted amino acid codes are:
Code | Amino acid |
---|---|
A | ALA alanine |
B | ASX aspartate or asparagine |
C | CYS cystine |
D | ASP aspartate |
E | GLU glutamate |
F | PHE phenylalanine |
G | GLY glycine |
H | HIS histidine |
I | ILE isoleucine |
K | LYS lysine |
L | LEU leucine |
M | MET methionine |
N | ASN asparagine |
P | PRO proline |
Q | GLN glutamine |
R | ARG arginine |
S | SER serine |
T | THR threonine |
U | selenocysteine |
V | VAL valine |
W | TRP tryptophan |
Y | TYR tyrosine |
Z | GLX glutamate or glutamine |
X | any |
- | translation stop
- | gap of indeterminate length
-
C. E. Shannon: A mathematical theory of communication. Bell System Technical Journal, vol. 27, pp. 379–423 and 623–656, July and October 1948
-
"What Is FASTA Format?" FASTA Format. Zhang Lab, University of Michigan, n.d. Web. 24 Sept. 2016.
-
Rost, B.; J.Liu; R.Nair; K.O. Wrzeszczynski; Y. Ofran (2003). "Automatic prediction of protein function". Cellular and Molecular Life Sciences. 60 (12): 2637–2650. doi:10.1007/s00018-003-3114-8. PMID 14685688.
-
Alpaydin, Ethem. "Introduction to Machine Learning." MIT Press, 04 Dec. 2009. Web. 24 Sept. 2016.
-
Nielsen, Michael. "Neural Networks and Deep Learning." Neural Networks and Deep Learning. N.p., June 2016. Web. 24 Sept. 2016.
-
"Implementing a Neural Network from Scratch in Python – An Introduction." WildML. N.p., 10 Jan. 2016. Web. 24 Sept. 2016.
-
"Recurrent Neural Networks Tutorial, Part 1 – Introduction to RNNs." WildML. N.p., 08 July 2016. Web. 24 Sept. 2016.
-
Singh, Jagjit. Great Ideas in Information Theory, Language and Cybernetics. New York: Dover Publications, 1966. Print.
-
"Structural Bioinformatics Guide: Protein Structure, Protein Sequence Analysis and Alignment, Homology Modeling." Protein Structure and Structural Bioinformatics. N.p., n.d. Web. 25 Sept. 2016.
-
Kurzynski, M.; Chelminiak, P. Stochastic Dynamics of Proteins and the Action of Biological Molecular Machines. Entropy 2014, 16, 1969-1982.
-
Strait, B J, and T G Dewey. “The Shannon Information Entropy of Protein Sequences.” Biophysical Journal 71.1 (1996): 148–155. Print.
-
Heunen, Christiaan Johan Marie., Mehrnoosh Sadrzadeh, and Edward Grefenstette. Quantum Physics and Linguistics: A Compositional, Diagrammatic Discourse. Oxford, United Kingdom: Oxford UP, 2013. Print.