Identifying multi-functional bioactive peptide functions using multi-label deep learning
Motivation:
The bioactive peptide has wide functions, such as lowering blood glucose levels and reducing inflammation. Meanwhile, computational methods such as machine learning are becoming more and more important for peptide functions prediction. Most of the previous studies concentrate on the single-functional bioactive peptides prediction. However, the number of multi-functional peptides is on the increase, therefore novel computational methods are needed.
Results:
In this study, we develop a method MLBP (Multi-Label deep learning approach for determining the multi-functionalities of Bioactive Peptides), which can predict multiple functions including anti-cancer, anti-diabetic, anti-hypertensive, anti-inflammatory and anti-microbial simultaneously. MLBP model takes the peptide sequence vector as input to replace the biological and physiochemical features used in other peptides predictors. Using the embedding layer, the dense continuous feature vector is learnt from the sequence vector. Then, we extract convolution features from the feature vector through the convolutional neural network layer, and combine with the bidirectional gated recurrent unit layer to improve the prediction performance. The 5-fold cross-validation experiments are conducted on the training dataset, and the results show that Accuracy and Absolute true are 0.695 and 0.685, respectively. On the test dataset, Accuracy and Absolute true of MLBP are 0.709 and 0.697, with 5.0% and 4.7% higher than those of the suboptimum method, respectively. The results indicate MLBP has superior prediction performance on the multi-functional peptides identification.
Note:
On the benchmark dataset, expect AMPs have the length > 45, the vast majority of ACPs, ADPs, AHPs and AIPs are with length < 45. Thus, for the peptides with length ≤ 45, the prediction results of MLBP are accurate. If the peptides with length > 45, the prediction results for ACP, ADP, AHP and AIP are less accurate.
FILE NAME | DESCRIPTION |
---|---|
main.py | the main file of MLBP predictor (include data reading, encoding, and data partitioning) |
train.py | train model |
model.py | model construction |
test.py | test model result |
evaluation.py | evaluation metrics (for evaluating prediction results) |
data | data |
BiGRU_base | models of MLBP |
-
Requirement
OS:
-
Windows
:Windows7 or later -
Linux
:Ubuntu 16.04 LTS or later
Python:
Python
>= 3.6
-
-
Download
MLBP
to your computergit clone https://github.com/xialab-ahu/MLBP.git
-
open the dir and install
requirement.txt
withpip
cd MLBP pip install -r requirement.txt
python predictor.py --file test.fasta --out_path result
-
--file
: input the test file with fasta format -
--out_path
: the output path of the predicted results
Please feel free to contact us if you need any help.
Tang W, Dai R, Yan W, et al. Identifying multi-functional bioactive peptide functions using multi-label deep learning[J]. Briefings in Bioinformatics, 2021.