Mining Entity Synonyms with Efficient Neural Set Generation

This repo includes datasets, model training scripts, and model evaluation scripts used in paper -- Mining Entity Synonyms with Efficient Neural Set Generation.

Details about SynSetMine model can be accessed here, and this implementation is based on the PyTorch library.

The documents would be available here.

Installation

Simply clone this repository via

git clone https://github.com/mickeystroller/SynSetMine-pytorch.git
cd SynSetMine-pytorch

Check whether the below dependencies are satisfied. If not, simply install them via

pip install -r requirements_full.txt

Training Model

You can train SynSetMine model and test its performance using commands in run.sh

chmod +x run.sh
./run.sh

By default, we will run on NYT dataset. You can uncomment the code in run.sh to run on the other two datasets.

Model snapshots will be saved in ./snapshots/ directory. Logs will be saved in ./runs/ directory, and final results will be stored in ./results/ directory.

Loading Pre-trained Model for Prediction

We save three pre-trained models, one for each dataset in ./snapshots/ directory. You can load them directly for prediction via:

chmod +x predict.sh
./predict.sh

Dependencies

Python 3 with NumPy
PyTorch > 0.4.0
sklearn
tensorboardX (to display/log information while model running)
gensim (to load embedding files)
tqdm (to display information while model running)
networkx (to calculate one particular evaluation metric)

Screenshot

References

If you find this code useful for your research, please cite the following paper in your publication:

@inproceedings{Shen2019SynSetMine,
  title={Mining Entity Synonyms with Efficient Neural Set Generation},
  author={Jiaming Shen and Ruiilang Lv and Xiang Ren and Michelle Vanni and Brian Sadler and Jiawei Han},
  booktitle={AAAI},
  year={2019}
}

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
.idea		.idea
data		data
dataloader		dataloader
docs		docs
screenshots		screenshots
snapshots		snapshots
.gitignore		.gitignore
README.md		README.md
cluster_predict.py		cluster_predict.py
evaluator.py		evaluator.py
main.py		main.py
model.py		model.py
options.py		options.py
predict.sh		predict.sh
requirements_full.txt		requirements_full.txt
run.sh		run.sh
utils.py		utils.py
zoo.py		zoo.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Mining Entity Synonyms with Efficient Neural Set Generation

Installation

Training Model

Loading Pre-trained Model for Prediction

Dependencies

Screenshot

References

About

Releases

Packages

Languages

mickeysjm/SynSetMine-pytorch

Folders and files

Latest commit

History

Repository files navigation

Mining Entity Synonyms with Efficient Neural Set Generation

Installation

Training Model

Loading Pre-trained Model for Prediction

Dependencies

Screenshot

References

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages