CorNet

Prerequisites

Pretrained Word Embeddings in gensim format

Preprocess (the EUR-Lex dataset is already tokenized in advance)

./scripts/preprocess_eurlex.sh

or (the other datasets need to be tokenized using NLTK)

./scripts/preprocess_others.sh

Train and evaluate

./scripts/run_models.sh

The codes for the baseline models are adapted from the following repositories: XML-CNN, BERT, MeSHProbeNet, and AttentionXML.

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
configure		configure
deepxml		deepxml
scripts		scripts
LICENSE		LICENSE
README.md		README.md
evaluation.py		evaluation.py
main.py		main.py
preprocess.py		preprocess.py