Skip to content

Latest commit

 

History

History
59 lines (59 loc) · 2.31 KB

2022-06-28-alon22a.md

File metadata and controls

59 lines (59 loc) · 2.31 KB
title booktitle abstract layout series publisher issn id month tex_title firstpage lastpage page order cycles bibtex_author author date address container-title volume genre issued pdf extras
Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval
Proceedings of the 39th International Conference on Machine Learning
Retrieval-based language models (R-LM) model the probability of natural language text by combining a standard language model (LM) with examples retrieved from an external datastore at test time. While effective, a major bottleneck of using these models in practice is the computationally costly datastore search, which can be performed as frequently as every time step. In this paper, we present RetoMaton - retrieval automaton - which approximates the datastore search, based on (1) saving pointers between consecutive datastore entries, and (2) clustering of entries into "states". This effectively results in a weighted finite automaton built on top of the datastore, instead of representing the datastore as a flat list. The creation of the automaton is unsupervised, and a RetoMaton can be constructed from any text collection: either the original training corpus or from another domain. Traversing this automaton at inference time, in parallel to the LM inference, reduces its perplexity by up to 1.85, or alternatively saves up to 83% of the nearest neighbor searches over $k$NN-LM (Khandelwal et al., 2020) without hurting perplexity. Our code and trained models are available at https://github.com/neulab/retomaton .
inproceedings
Proceedings of Machine Learning Research
PMLR
2640-3498
alon22a
0
Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval
468
485
468-485
468
false
Alon, Uri and Xu, Frank and He, Junxian and Sengupta, Sudipta and Roth, Dan and Neubig, Graham
given family
Uri
Alon
given family
Frank
Xu
given family
Junxian
He
given family
Sudipta
Sengupta
given family
Dan
Roth
given family
Graham
Neubig
2022-06-28
Proceedings of the 39th International Conference on Machine Learning
162
inproceedings
date-parts
2022
6
28