How Transformer Speak: An Interacting Multi-Particle Perspective

The Transformer neural network architecture is a cornerstone of many modern state-of-the-art AI systems, from large language models for text generation to image segmentation for autonomous vehicles. Still, little is known about the inner working principles of Transformers and how to interpret them. We take one step towards opening the black box with a series of empirical evaluations. First, we demonstrate that in the latent space of a Transformer model the tokens cluster over time, indicating a kind of consensus dynamics. Second, we draw a connection to clustered federated consensus-based optimisation, which affords the interpretation of tokens cooperating in groups to evolve towards a consensus point that is most relevant to the group. Our work provides stepping stones for further discoveries that benefit the explainability and trustability of Transformer-based AI applications.

Visualisations

Refer to the report for more details. Example plots below.

Name		Name	Last commit message	Last commit date
Latest commit History 52 Commits
visualisation		visualisation
.gitattributes		.gitattributes
README.md		README.md
experimentation.py		experimentation.py
heatmap01-24.png		heatmap01-24.png
heatmap25-48.png		heatmap25-48.png
notebook.ipynb		notebook.ipynb
plotting.py		plotting.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

How Transformer Speak: An Interacting Multi-Particle Perspective

Visualisations

About

Releases

Packages

Languages

nkirschi/How-Transformers-Speak

Folders and files

Latest commit

History

Repository files navigation

How Transformer Speak: An Interacting Multi-Particle Perspective

Visualisations

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages