SUGAR

Code for "SUGAR: Subgraph Neural Network with Reinforcement Pooling and Self-Supervised Mutual Information Mechanism"

Overview

train.py: the core of our model, including the structure and the process of training.
env.py, QLearning.py: the code about RL method
GCN.py, layers.py, SAGE.py: including the basic layers we used in the main model.
dataset/: including the dataset. MUTAG, DD, NCI1, NCI109, PTC_MR, ENZYMES, PROTEINS (the download link will be provided later).
- 'RAW/': the original data of the dataset
- adj.npy: the biggest Adjacency Matrix built from dataset
- graph_label.npy: the label of every sub_graph
- sub_adj.npy: the Adjacency Matrix of subgraph through sampling
- features.npy: the pre-handled features of each subgraph

Datasets

MUTAG: The MUTAG dataset consists of 188 chemical compounds divided into two classes according to their mutagenic effect on a bacterium.
D&D: D&D is a dataset of 1178 protein structures (Dobson and Doig, 2003). Each protein is represented by a graph, in which the nodes are amino acids and two nodes are connected by an edge if they are less than 6 Angstroms apart. The prediction task is to classify the protein structures into enzymes and non-enzymes.
NCI1&NCI109:NCI1 and NCI109 represent two balanced subsets of datasets of chemical compounds screened for activity against non-small cell lung cancer and ovarian cancer cell lines respectively (Wale and Karypis (2006) and http://pubchem.ncbi.nlm.nih.gov).
ENZYMES: ENZYMES is a dataset of protein tertiary structures obtained from (Borgwardt et al., 2005) consisting of 600 enzymes from the BRENDA enzyme database (Schomburg et al., 2004). In this case the task is to correctly assign each enzyme to one of the 6 EC top-level classes.

Setting

mkdir "dataset" & download the dataset into it
setting up python env
run python train.py(all the parameters could be viewed in the train.py)

parameters

 --dataset DATASET
 --num_info NUM_INFO
 --lr LR (learning_rate)
 --max_pool MAX_POOL
 --momentum MOMENTUM
 --num_epoch NUM_EPOCH
 --batch_size BATCH_SIZE
 --sg_encoder SG_ENCODER(GIN, GCN, GAT, SAGE)
 --MI_loss MI_LOSS
 --start_k START_K

Reference

@inproceedings{sun2021sugar,
  title={SUGAR: Subgraph Neural Network with Reinforcement Pooling and Self-Supervised Mutual Information Mechanism},
  author={Sun, Qingyun and Li, Jianxin and Peng, Hao and Wu, Jia and Ning, Yuanxing and Yu, Phillip S and He, Lifang},
  booktitle={Proceedings of the 2021 World Wide Web Conference},
  year={2021}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SUGAR

Overview

Datasets

Setting

parameters

Reference

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 42 Commits
dataset		dataset
GCN.py		GCN.py
QLearning.py		QLearning.py
README.md		README.md
SAGE.py		SAGE.py
env.py		env.py
layers.py		layers.py
train.py		train.py

SunQingYun1996/SUGAR

Folders and files

Latest commit

History

Repository files navigation

SUGAR

Overview

Datasets

Setting

parameters

Reference

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages