Skip to content

Latest commit

 

History

History
5 lines (3 loc) · 615 Bytes

README.md

File metadata and controls

5 lines (3 loc) · 615 Bytes

This repository contains dataset for our EMNLP 2019 paper "Attribute-aware Sequence Network for Review Summarization".

Download our dataset from https://pan.baidu.com/s/1n256L9o3DVoshum65Efo3g with password "6tkw".

train.txt, test.txt, dev.txt represent training set, testing set and development set. Each line in each file is a sample. Each line makes up of 7 elements, which are split by "\t\t". Element 1 is the user ID, element 2 is the overall rating (which is not used in this paper), element 3 is the review content and element 4 is the summary of the review, element 5-7 are age, gender and travel state.