title

booktitle

abstract

layout

series

publisher

issn

id

month

tex_title

firstpage

lastpage

page

order

cycles

bibtex_author

author

date

address

container-title

volume

genre

issued

pdf

extras

Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval

Proceedings of the 39th International Conference on Machine Learning

Retrieval-based language models (R-LM) model the probability of natural language text by combining a standard language model (LM) with examples retrieved from an external datastore at test time. While effective, a major bottleneck of using these models in practice is the computationally costly datastore search, which can be performed as frequently as every time step. In this paper, we present RetoMaton - retrieval automaton - which approximates the datastore search, based on (1) saving pointers between consecutive datastore entries, and (2) clustering of entries into "states". This effectively results in a weighted finite automaton built on top of the datastore, instead of representing the datastore as a flat list. The creation of the automaton is unsupervised, and a RetoMaton can be constructed from any text collection: either the original training corpus or from another domain. Traversing this automaton at inference time, in parallel to the LM inference, reduces its perplexity by up to 1.85, or alternatively saves up to 83% of the nearest neighbor searches over $k$NN-LM (Khandelwal et al., 2020) without hurting perplexity. Our code and trained models are available at https://github.com/neulab/retomaton .

inproceedings

Proceedings of Machine Learning Research

PMLR

2640-3498

alon22a

0

Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval

468

485

468-485

468

false

Alon, Uri and Xu, Frank and He, Junxian and Sengupta, Sudipta and Roth, Dan and Neubig, Graham

given	family
Uri	Alon

given	family
Frank	Xu

given	family
Junxian	He

given	family
Sudipta	Sengupta

given	family
Dan	Roth

given	family
Graham	Neubig

2022-06-28

Proceedings of the 39th International Conference on Machine Learning

162

inproceedings

date-parts

2022

6

28

https://proceedings.mlr.press/v162/alon22a/alon22a.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

2022-06-28-alon22a.md

2022-06-28-alon22a.md

Files

2022-06-28-alon22a.md

Latest commit

History

2022-06-28-alon22a.md

File metadata and controls