-
Notifications
You must be signed in to change notification settings - Fork 0
FASTA Format
A sequence in FASTA format begins with a single-line description, followed by lines of sequence data. The description line is distinguished from the sequence data by a greater-than (">") symbol in the first column. It is recommended that all lines of text be shorter than 80 characters in length.
The accepted amino acid codes are: A ALA alanine P PRO proline B ASX aspartate or asparagine Q GLN glutamine C CYS cystine R ARG arginine D ASP aspartate S SER serine E GLU glutamate T THR threonine F PHE phenylalanine U selenocysteine G GLY glycine V VAL valine H HIS histidine W TRP tryptophan I ILE isoleucine Y TYR tyrosine K LYS lysine Z GLX glutamate or glutamine L LEU leucine X any M MET methionine * translation stop N ASN asparagine - gap of indeterminate length
-
C. E. Shannon: A mathematical theory of communication. Bell System Technical Journal, vol. 27, pp. 379–423 and 623–656, July and October 1948
-
"What Is FASTA Format?" FASTA Format. Zhang Lab, University of Michigan, n.d. Web. 24 Sept. 2016.
-
Rost, B.; J.Liu; R.Nair; K.O. Wrzeszczynski; Y. Ofran (2003). "Automatic prediction of protein function". Cellular and Molecular Life Sciences. 60 (12): 2637–2650. doi:10.1007/s00018-003-3114-8. PMID 14685688.
-
Alpaydin, Ethem. "Introduction to Machine Learning." MIT Press, 04 Dec. 2009. Web. 24 Sept. 2016.
-
Nielsen, Michael. "Neural Networks and Deep Learning." Neural Networks and Deep Learning. N.p., June 2016. Web. 24 Sept. 2016.
-
"Implementing a Neural Network from Scratch in Python – An Introduction." WildML. N.p., 10 Jan. 2016. Web. 24 Sept. 2016.
-
"Recurrent Neural Networks Tutorial, Part 1 – Introduction to RNNs." WildML. N.p., 08 July 2016. Web. 24 Sept. 2016.
-
Singh, Jagjit. Great Ideas in Information Theory, Language and Cybernetics. New York: Dover Publications, 1966. Print.
-
"Structural Bioinformatics Guide: Protein Structure, Protein Sequence Analysis and Alignment, Homology Modeling." Protein Structure and Structural Bioinformatics. N.p., n.d. Web. 25 Sept. 2016.
-
Kurzynski, M.; Chelminiak, P. Stochastic Dynamics of Proteins and the Action of Biological Molecular Machines. Entropy 2014, 16, 1969-1982.
-
Strait, B J, and T G Dewey. “The Shannon Information Entropy of Protein Sequences.” Biophysical Journal 71.1 (1996): 148–155. Print.
-
Heunen, Christiaan Johan Marie., Mehrnoosh Sadrzadeh, and Edward Grefenstette. Quantum Physics and Linguistics: A Compositional, Diagrammatic Discourse. Oxford, United Kingdom: Oxford UP, 2013. Print.