SLMTokBench: Speech Language Model Token Benchmark

Existing speech tokens are not explicitly designed for speech language modeling, and there has been no exploration into their suitability for building speech language models. To address this gap, we build Speech Language Model Token Benchmark (SLMTokBench), to assess the suitability of speech tokens for constructing speech language models. In this benchmark, we evaluate the alignment between speech tokens and text by estimating their mutual information. We assess preservation of speech information within speech tokens by evaluating the quality of resynthesized speech.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SLMTokBench: Speech Language Model Token Benchmark

About

Releases

Packages

0nutation/SLMTokBench

Folders and files

Latest commit

History

Repository files navigation

SLMTokBench: Speech Language Model Token Benchmark

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages