- mha-implementations.ipynb contains and compares different implementations of multi-head attention
The figures below summarize the performance benchmarks (lower is better).
The figures below summarize the performance benchmarks (lower is better).