This is a reference code for research paper "RRE: A Relevance Relation Extraction Framework for Cross-Domain Recommender System at Alipay" accepted by ICME 2024.
Jiayang Gu, Xovee Xu, Yulu Tian, Yurun Hu, Jiadong Huang, Wenliang Zhong, Fan Zhou, Lianli Gao
RRE: A Relevance Relation Extraction Framework for Cross-Domain Recommender System at Alipay
IEEE International Conference on Multimedia and Expo (ICME), Niagra Falls, Canada, July 15-19, 2024
python>=3.7
torch>=1.6
transformers>=4.15
node2vec
The running or our code involves 3 steps: (1) data processing; (2) model training; and (3) evaluation.
The predictions of our models are in predictions directory. The WN18RR and FB15k-237 datasets are from the KG-BERT repo.
Take WN18RR as an example:
- Step 1: Data preprocessing
bash scripts/preprocess.sh WN18RR
- Step 2: Generate topological information
Unlike the open source dataset, our mini-program & content dataset has clearly defined the level of strength of relevance. So in the topology generation of our dataset, we connect edges except for "no correlation". As for open source datasets, take WN18RR as an example, we measure the semantic meaning of the corresponding 13 relation types, then decides the directions of the edge.
bash scripts/generate_topo.sh
By replacing the right path of your train.txt and test.txt of WN18RR, you can get an embedding file under output path. - Step 3: Model training (<3 hours)
OUTPUT_DIR=./checkpoint/wn18rr/ bash scripts/train_wn.sh
- Step 4: Evaluation (on a trained model)
bash scripts/eval.sh ./checkpoint/wn18rr/model_last.mdl WN18RR
Also, there exist some triggers to control modules used in our model, located in models_no_hr_vector.py
. If you don't want to use momentum learning or topology info, feel free to set FALSE
to them.
@inproceedings{gu2024rre,
title = {RRE: A Relevance Relation Extraction Framework for Cross-Domain Recommender System at Alipay}
author = {Jiayang Gu and Xovee Xu and Yulu Tian and Yurun Hu and Jiadong Huang and Wenliang Zhong and Fan Zhou and Lianli Gao},
booktitle = {IEEE International Conference on Multimedia and Expo},
year = {2024},
address = {Niagra Falls, Canada},
publisher = {IEEE}
}
We would like to thank Wang et al. for their open source code SimKGC, on which our implements are based.