GitHub - pdufter/densray: Getting interpretable dimensions in word embedding spaces.

Introduction

This repository compares different methods of obtaining interpretable dimension in word embedding spaces.

More specifically it compares:

Densifier
DensRay: A method closely related to Densifier, but computable in closed form.
Support Vector Machines / Regression
Linear / Logistic Regression.

The evaluation tasks are lexicon induction and set-based word analogy.

For more details see the Paper.

Note that this repo does not include an implementation of the Densifier, but relies on the original Matlab implementation by the authors of Densifier.

Usage

For an example how to use the code see example.sh.

References

If you use the code, please cite

@article{dufter2019analytical,
  title={Analytical Methods for Interpretable Ultradense Word Embeddings},
  author={Dufter, Philipp and Sch{\"u}tze, Hinrich},
  journal={arXiv preprint arXiv:1904.08654},
  year={2019}
}

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
analogy		analogy
data		data
debiasing		debiasing
evaluation		evaluation
lexind		lexind
model		model
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
example.sh		example.sh
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Introduction

Usage

References

About

Releases

Packages

Languages

License

pdufter/densray

Folders and files

Latest commit

History

Repository files navigation

Introduction

Usage

References

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages