Name		Name	Last commit message	Last commit date
parent directory ..
checkpoints		checkpoints
data		data
datasets		datasets
README.md		README.md
eval.py		eval.py
graphCNF.py		graphCNF.py
graph_node_edge_coupling.py		graph_node_edge_coupling.py
mutils.py		mutils.py
task.py		task.py
train.py		train.py

README.md

Molecule generation

This experiment folder summarizes the experiments on molecule generation.

Datasets

We provide scripts to run experiments on the two datasets Zinc250k and Moses. Before running any experiments, please download the preprocessed datasets, and place them inside the data/ folder (i.e. data/moses/ and data/zinc250k/). In case you only want to run an experiment on one of the two datasets, it is sufficient to download the respective folder and ignore the other dataset.

Example molecules of both datasets can be seen below. Both dataset files contain code for visualizing and plotting molecules.

Zinc250k	Moses

Training

To train GraphCNF on Zinc250k, use the following command:

python train.py --dataset zinc250k \
                --max_iterations 150000 \
                --batch_size 64 \
                --encoding_dim_nodes 6 \
                --encoding_dim_edges 2 \
                --optimizer 4 \
                --learning_rate 5e-4 \
                --checkpoint_path checkpoints/GraphCNF_zinc250k/ \
                --cluster

For training the model on Moses, replace --dataset zinc250k with --dataset moses and increase the batch size 96. The --cluster argument is set to reduce the printed output to stdout for longer experiments.

Evaluation

GraphCNF can be evaluated by:

python eval.py --checkpoint_path path_to_folder

where path_to_folder should be replaced with the path to the actual folder with the checkpoints. The evaluation script applies the saved model to the test set and samples 10k new molecule examples from the model. The final performance on all metrics is saved in the file eval_metrics.json.

Pretrained models

Pretrained models of GraphCNF can be found here. After downloading the model folders, place them inside the checkpoints folder (for instance, checkpoints/GraphCNF. You can evaluate the pretrained models by running the evaluation script:

python eval.py --checkpoint_path checkpoints/GraphCNF/

Results

Zinc250k

Model	Validity	Uniqueness	Novelty	Bits per node
GraphCNF (pretrained)	83.41%	99.99%	100%	5.27bpd
+ Subgraphs	96.35%	99.98%	100%

Moses

Model	Validity	Uniqueness	Novelty	Bits per node
GraphCNF (pretrained)	82.56%	100%	100%	4.94bpd
+ Subgraphs	95.66%	99.98%	100%

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

molecule_generation

molecule_generation

README.md

Molecule generation

Datasets

Training

Evaluation

Pretrained models

Results

Zinc250k

Moses

Example generations

Files

molecule_generation

Directory actions

More options

Directory actions

More options

Latest commit

History

molecule_generation

Folders and files

parent directory

README.md

Molecule generation

Datasets

Training

Evaluation

Pretrained models

Results

Zinc250k

Moses

Example generations