KinomePro-DL

The Development and Application of KinomePro-DL: A Deep Learning Based Online Small Molecule Kinome Selectivity Profiling Prediction Platform

Here, we report a deep learning classification model using an in-house built data set of inhibitors against 191 well-representative kinases constructed based on a novel strategy by systematically cleaning and integrating six public data sets. This model, a multitask deep neural network, predicts the kinome selectivity profiles of compounds with novel structures. The model demonstrates excellent predictive performance, with auROC, prc-AUC, Accuracy, and Binary_cross_entropy of 0.95, 0.92, 0.90, and 0.37, respectively. It also performs well in a priori testing for inhibitors targeting different categories of proteins from internal compound collections, significantly improving over similar models on data sets from practical application scenarios. Integrated to subsequent machine learning-enhanced virtual screening workflow, novel CDK2 kinase inhibitors with potent kinase inhibitory activity and excellent kinome selectivity profiles are successfully identified. Additionally, we developed a free online web server, KinomePro-DL, to predict the kinome selectivity profiles and kinome-wide polypharmacology effects of small molecules (available on [kinomepro-dl.pharmablock.com](https://kinomepro-dl.pharmablock.com)). Uniquely, our model allows users to quickly fine-tune it with their own training data sets, enhancing both prediction accuracy and robustness.

Cite Us

If you found this tools useful, please cite our paper:

Wei Ma, Jiaqi Hu, Zhuangzhi Chen, Yaoqin Ai, Yihang Zhang, Keke Dong, Xiangfei Meng, and Liu Liu
Journal of Chemical Information and Modeling 2024 64 (19), 7273-7290
DOI: 10.1021/acs.jcim.4c00595

Introduction to the use of the KinomePro-DL model

1.Overview

Descriptor

Description of the Model: Prediction the kinase activity profile of the compound **1.Kinase profile: **the more red points in the map indicates that the compound potential selectivity may be worse; the less the map points indicates that the compound potential selectivity may be better.

2.Odds diagram: indicating that the compound may be a selective compound of a kinase group, the greater the odds value, the greater the selectivity of the kinase group, and vice versa.

3.Prediction result file: value range (0-1), the greater the value, the greater the potential activity of the target, and vice versa.

4.S_score: Value range (0-1), dividing the number of all targets predicted to be active by the total number of kinase profile targets (191), the greater the value, the worse the selectivity for the kinase profile, and vice versa.

2.How to use KinomePro-DL

KinomePro-DL provide three methods of molecule submission:

1). A molecule is submitted from web server by uploading a smiles;

2). A molecule is submitted from web server by drawing structure;

3). Batch compounds are submitted from web server by uploading a .csv file. The file format is as follows;

The first column is named 'smiles' and records the smiles information of the compounds, The second column is named 'comp _ id', recording the name information of the compound.

Method 1

You can paste a single SMILES string to the textbox, then click on the submit button to submit the calculation.

Method 2

You can Draw a molecule using JMSE editor,then click on the submit button to submit the calculation.

Method 3

You can submit by uploading csv files containing batch molecules, then set the S_score threshold (Default: 0.1) and click on the submit button to submit the calculation.

3.Prediction Result

In the single molecule submission mode, five figures are displayed on the result page after the calculation.

Structure

Is the structure of the submitted molecule;

Kinase Profiling

Kinase profiles of the submitted molecules predicted by the model. The more red points in the map suggest that compound potential selectivity may be poor. The fewer map points suggest that compound potential selectivity may be better.

Odds

Odds of predicted compounds, suggesting kinase family selectivity of the compounds. Indicates that the compound may be a selective compound for a class of kinases, the greater the odds of selectivity for the kinase, and vice versa.

Tsne

Show the 2d spatial dimension reduction results for the submitted compounds with known kinase activity;

Similarity_Result

Show the reference structures of the top 50 known kinase-active compounds with the highest similarity to the submitted molecules;

4.Result Download

After the calculation, users can click on the "Download result" button in the lower right corner to download the result file.

The downloaded compressed package contains two csv files, where the csv file named after the date records the prediction information of the model on the submitted compound. The first column is the name of the corresponding kinase target, The second column is the corresponding probability predicted to be active against the specified kinase target.S_score (1 uM) records the overall kinasprofile selectivity of the model predicted for the compound.

The second named ref_mols_result.csv file records information on the activity of known kinase active compounds with similarity to submitted predicted compounds greater than 0.5;This file may be empty when not matching to a compound with a similarity greater than 0.5.

5 Fine-Tuning & Finetuned_Prediction

The procedure for fine-tuning the model involves first uploading the prepared training data (refer to Example for the required format), then clicking the Submit button to initiate the training process, and finally downloading the updated model parameters once the training has completed.

The procedure for making predictions using the fine-tuned model involves first uploading the molecules to be predicted (refer to Example for the required format), then uploading the compressed model parameter file (.tar.gz) downloaded after fine-tuning, and finally clicking the Submit button to initiate the prediction process.

6.Miscellaneous

Time consumption of a job

Basically, the calculation process (including activity calculation and graph generation) takes about five minutes. Due to computing resource limitations, jobs submitted from all users may be forced to enter the waiting list before running. Therefore, a little patience may be needed when there is a lot of work waiting to be handled.

Data storage time

On KinomePro-DL, a maximum of 7 days of the job records will be stored, so we strongly recommend users to download all of the results and figures at their suitable times.

Name		Name	Last commit message	Last commit date
Latest commit History 43 Commits
fig		fig
LICENSE		LICENSE
PKIS.zip		PKIS.zip
README.md		README.md
Supporting materials 08-21-2024 2nd revision.docx		Supporting materials 08-21-2024 2nd revision.docx
dataset_train_target_191.zip		dataset_train_target_191.zip
inner-15comps.xlsx		inner-15comps.xlsx

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

KinomePro-DL

The Development and Application of KinomePro-DL: A Deep Learning Based Online Small Molecule Kinome Selectivity Profiling Prediction Platform

Cite Us

1.Overview

Descriptor

4.S_score: Value range (0-1), dividing the number of all targets predicted to be active by the total number of kinase profile targets (191), the greater the value, the worse the selectivity for the kinase profile, and vice versa.

2.How to use KinomePro-DL

Method 1

Method 2

Method 3

3.Prediction Result

Structure

Kinase Profiling

Odds

Tsne

Similarity_Result

4.Result Download

5 Fine-Tuning & Finetuned_Prediction

6.Miscellaneous

About

Releases

Packages

License

Timeless1/KinomePro-DL

Folders and files

Latest commit

History

Repository files navigation

KinomePro-DL

The Development and Application of KinomePro-DL: A Deep Learning Based Online Small Molecule Kinome Selectivity Profiling Prediction Platform

Cite Us

1.Overview

Descriptor

4.S_score: Value range (0-1), dividing the number of all targets predicted to be active by the total number of kinase profile targets (191), the greater the value, the worse the selectivity for the kinase profile, and vice versa.

2.How to use KinomePro-DL

Method 1

Method 2

Method 3

3.Prediction Result

Structure

Kinase Profiling

Odds

Tsne

Similarity_Result

4.Result Download

5 Fine-Tuning & Finetuned_Prediction

6.Miscellaneous

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Packages