This directory contains the code and data used for the experiments prensented in the paper Short-Term Meaning Shift: A Distributional Exploration, included in the Proceedings of NAACL-HLT 2019.
The final paper is in ./Paper/
, while the slides used for short presentation at NAACL-HLT 2019 are in ./Slides/
.
We release the following material:
- Our implementation of the model introduced by Kim et. al (2014) for creating time dependent word representations, see
./Model/
. - The dataset annotated by users of the subbredit r/LiverpoolFC that we used for our experiments, see
./Dataset/
.
To run the model, Python 3 is required. You can install all the required packages using the following command:
$ pip3 install -r requirements.txt
To create word embeddings without any kind of pre-initialization use the following command:
python3 create_vectors.py -data <data_file> -output <output_directory>
In order to initialize the word embeddings for time bin t with those in t-1 use this command:
python3 create_vectors_preinitialization.py -data_t <data_file_for_time_t> -data_t_minus_1 <data_file_for_time_t-1> -vectors <vectors computed for t-1> -output <output_directory>
For both files, it is possible to pass as arguments the hyper-parameters of the model.
The dataset is provided in the in two files:
./Dataset/LiverpoolFC_[13|17].txt.zip
: the linguistic content produced by the members of r/LiverpoolFC in the two time bins 2011/13 and 2017../Dataset/contexts.txt
: the examples showed to the redditors. For each word, the contexts of use in the two time bins../Dataset/annotated_words.csv
: the annotation by the redditors. For each line: id, target word, shift index value, annotation provided by redditors: [change|no_change|not_available(NA)]
If you use this code or dataset, please cite the following paper:
@inproceedings{del2019short-term,
Author = {Del Tredici, Marco and Fern\'andez, Raquel and Boleda, Gemma},
Booktitle = {Proceedings of NAACL-HLT 2019 (Annual Conference of the North American Chapter of the Association for Computational Linguistics)},
Title = {{Short-Term Meaning Shift: A Distributional Exploration}},
Year = {2019}
}