The code for ACL2020 paper "Attend, Translate and Summarize: An Efficient Method for Neural Cross-Lingual Summarization"
- pytorch >= 1.0
- python >= 3.6
The datasets used in this paper can be found here.
-
You should first obtain the word alignment by the fastalign tool. There is an example in "ATS-NE/beaver/data/savefile/c2e.pkl".
-
Build vocab
cd ATS-NE/tools && python build_vocab $number < $textfile > $vocab_file
- Build the translation candidates and their index.
cd .. && python -m beaver.data.post_prob
- Start training
python train.py -config run_config/train.json
- Start decoding
python translate.py -config run_config/decode.json
The summaries generated by our methods as well as their ROUGE score and MoverScore are presented in "resultofpaper" folder. The scripts we use to obtain rouge scores and MoverScore are also included.
We would appreciate your citation if you find this is beneficial.
@inproceedings{zhu-etal-2020-attend,
title = "Attend, Translate and Summarize: An Efficient Method for Neural Cross-Lingual Summarization",
author = "Zhu, Junnan and
Zhou, Yu and
Zhang, Jiajun and
Zong, Chengqing",
booktitle = "Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics",
month = jul,
year = "2020",
address = "Online",
publisher = "Association for Computational Linguistics",
url = "https://www.aclweb.org/anthology/2020.acl-main.121",
pages = "1309--1321",
}
If you have any question, please feel free to contact us by sending an email to {junnan.zhu, yzhou, jjzhang, cqzong}@nlpr.ia.ac.cn.
This project is licensed under the BSD License - see LICENSE.md for details.
The copyright of this code belongs to the authors, and the code is only used for research purposes. Display, reproduction, transmission, distribution or publication of this code is prohibited.