Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

DeepTox: Toxicity Prediction using Deep Learning #72

Closed
agitter opened this issue Aug 9, 2016 · 5 comments
Closed

DeepTox: Toxicity Prediction using Deep Learning #72

agitter opened this issue Aug 9, 2016 · 5 comments
Assignees
Labels

Comments

@agitter
Copy link
Collaborator

agitter commented Aug 9, 2016

http://doi.org/10.3389/fenvs.2015.00080

Related to virtual screening #45.

@agitter
Copy link
Collaborator Author

agitter commented Aug 25, 2016

Biology

Computational aspects

  • Cites graph-based kernels for molecular graphs dating back to 2003 (relevant to Convolutional Networks on Graphs for Learning Molecular Fingerprints #52 and Molecular Graph Convolutions: Moving Beyond Fingerprints #53)
  • Acknowledge some of the related deep learning methods but do not discuss them or compare to them
  • Won the grand challenge and several sub-challenges, but not the top performer for every individual assay
  • Score with AUC despite class imbalance
  • Use multi-task learning, number of tasks is comparable to (Multi-task Neural Networks for QSAR Predictions #57) but less than (Massively Multitask Networks for Drug Discovery #55)
  • Claim deep learning in general performs well with a large dataset, related input features, and multi-task setting. Tox21 satisfies these conditions.
  • Indicator variable in objective function to ignore untested compounds
  • ReLU hidden units with multi-task sigmoid output layer
  • 1024, 2048, 4096, 8192, or 16384 hidden units per layer and up to 4 hidden layers
  • Optimize hyperparameters separately for each task even though they have a multi-task network
  • Supplement the challenge training data with similar compounds and assays from PubChem, ChEMBL, etc.
  • Also use SVM, random forest, and elastic nets both as competing methods and in an ensemble with the neural network
  • Cluster-based cross validation to help prevent indirect test fold leakage when compounds are very similar
  • Multi-task network outperforms single-task for 10 of 12 assays, but Massively Multitask Networks for Drug Discovery #55 has a better assessment of the impact of the number of tasks in this domain
  • Adding chemical descriptor inputs doesn't substantially boost performance relative to fingerprint features alone
  • For interpretation, associate hidden units with toxicophores using U-test and correlation
  • They claim that the neural network is better than competing methods in most cases, but either SV or random forest is better than or at least competitive with the neural network in terms of AUC for many of the assays
  • Provide code and a cleaned dataset

Why include it in the review

  • Winning a challenge can help bolster claims that this is the state of the art in the domain
  • This is not the authors' interpretation, but in my opinion the AUCs may suggest that the neural network was more of an incremental than a 'transformative' improvement for many assays
  • Includes some discussion of interpreting hidden units in the virtual screening domain
  • The study is generally well-executed, even if many of the computational ideas had already appeared in other virtual screening papers. This work could be mentioned but may not be a major focal point.

Other notes

@hmf0103
Copy link

hmf0103 commented Aug 26, 2018

@agitter Hi, do you know where to find deeptox source code?

@agitter
Copy link
Collaborator Author

agitter commented Aug 26, 2018

@hmf0103 in my original notes above I linked to the code at https://github.com/bioinf-jku/binet. It may not be the exact same code used in this paper, but you could ask the authors in the Issues there.

@vinay-hebb
Copy link

vinay-hebb commented Aug 31, 2022

@agitter

I can't find any mention of the code in their paper. How did you infer that they used binet?

@agitter
Copy link
Collaborator Author

agitter commented Aug 31, 2022

@vinay-hebb I no longer remember how I originally associated that repo with the paper. It may have been through the authors' DeepTox website. In bioinf-jku/binet#6 the author also confirmed

DeepTox was done using binet, as well as other libraries (e.g. rdkit and scikit-learn).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

4 participants