Release v0.4.0 #60

kmaziarz · 2023-06-16T11:46:13Z

This PR edits the CHANGELOG to mark the v0.4.0 release. After it's merged, I will tag the resulting commit, and then push the release to PyPI.

In particular, this release includes several changes that make molecule_generation compatible with the latest versions of rdkit and numpy.

thk9178 · 2023-06-30T00:53:32Z

Thank you for publishing your great work.
In addition to VAE, I would like to supervised train my SMILES-property data, do you have a manual for the program?
If not, how can I train it with my property data?

kmaziarz · 2023-07-13T14:30:20Z

In addition to VAE, I would like to supervised train my SMILES-property data, do you have a manual for the program? If not, how can I train it with my property data?

Thank you for your question @thk9178, and sorry for the slow response. This package only provides the code to train the VAE, which would typically be done in a property-agnostic way. If you have some property data, there are essentially two routes:

Train a separate supervised property predictor, and use it to perform optimization in the latent space of a trained MoLeR model (for the optimization part itself you could use things like Bayesian Optimisation, Molecular Swarm Optimisation, Genetic Algorithms, etc).
Take MoLeR trained in a property-agnostic way and then fine-tune it further on samples with high property values. Then "property optimization" can be performed by sampling from the prior directly and ranking the results. We haven't explored this direction much though.

If you take the first route, then you need to train a supervised property prediction model, for which you'd have to look elsewhere. A simple starting point would be to just train a shallow MLP on molecular fingerprints.

thk9178 · 2023-07-14T01:17:06Z

In addition to VAE, I would like to supervised train my SMILES-property data, do you have a manual for the program? If not, how can I train it with my property data?

Thank you for your question @thk9178, and sorry for the slow response. This package only provides the code to train the VAE, which would typically be done in a property-agnostic way. If you have some property data, there are essentially two routes:

Train a separate supervised property predictor, and use it to perform optimization in the latent space of a trained MoLeR model (for the optimization part itself you could use things like Bayesian Optimisation, Molecular Swarm Optimisation, Genetic Algorithms, etc).

Take MoLeR trained in a property-agnostic way and then fine-tune it further on samples with high property values. Then "property optimization" can be performed by sampling from the prior directly and ranking the results. We haven't explored this direction much though.

If you take the first route, then you need to train a supervised property prediction model, for which you'd have to look elsewhere. A simple starting point would be to just train a shallow MLP on molecular fingerprints.

I'm so grateful for your help. I can definitely do that as you recommended.
I'll have to pray that MoLer's latent space shows a smooth gradient for my property data. 😁
Thank you very much for your kind reply.

doc(CHANGELOG): Release v0.4.0

ac961bf

kmaziarz requested review from josejimenezluna and mrwnmsr June 16, 2023 11:46

josejimenezluna approved these changes Jun 16, 2023

View reviewed changes

kmaziarz merged commit d243e6a into main Jun 16, 2023
5 checks passed

kmaziarz deleted the kmaziarz/release-0.4.0 branch June 16, 2023 17:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Release v0.4.0 #60

Release v0.4.0 #60

kmaziarz commented Jun 16, 2023

thk9178 commented Jun 30, 2023

kmaziarz commented Jul 13, 2023

thk9178 commented Jul 14, 2023

Release v0.4.0 #60

Release v0.4.0 #60

Conversation

kmaziarz commented Jun 16, 2023

thk9178 commented Jun 30, 2023

kmaziarz commented Jul 13, 2023

thk9178 commented Jul 14, 2023