A database of the Knowledge Diffusion for Neural Dialogue Generation.
The files are in json and the encoding format is UTF8.
KB for movies and celebrities
--id
: unique ID for each movie
--title
: title of the movie
--director
: a list of celebrityIDs who direct the movie
--actor
: a list of celebrityIDs who act the movie
--duration
: movie duration in minutes
--release_time
: movie release year
--comments
: a list of 5 comments on the movie
--id
: unique ID for each celebrity
--name
: name of the celebrity
Conversations grounded on knowledge base
--raw_sentence
: raw utterance
--movie
: a list of movie and corresponding movieID appear in raw_sentence for similar knowledge based conversations
--celebrity
: a list of celebrity and corresponding celebrityID appear in raw_sentence for similar knowledge based conversations
--triple
: a list of facts(subjectID, relation, objectID) for revelant knowledge based conversations
We appreciate your citation if you find our dataset is beneficial.
@inproceedings{liu2018NKD,
title={Knowledge Diffusion for Neural Dialogue Generation},
author={Liu, Shuman and Chen, Hongshen and Ren, Zhaochun and Feng, Yang and Liu, Qun and Yin, Dawei},
booktitle={Proceedings of the 56 th Annual Meeting of the Association for Computational Linguistics(Volume 1: Long Papers)},
pages={1489--1498},
year={2018},
publisher = {Association for Computational Linguistics},
}