Skip to content

Control the speed of the voice an duration of pause between words/sentences #181

Closed Answered by erew123
kaosbeat asked this question in Q&A
Discussion options

You must be logged in to vote

Hi @kaosbeat

The only real way to speed up pauses etc, that I know of with the XTTS model is to use a reference sample WAV that speaks faster. So if you have the same person speaking slower the generated text speaks slower (from anecdotal evidence) and faster when the reference sample wav is spoken a bit faster.

Beyond that, removing commas, semi-commas etc will remove pauses.

There is a generation speed, which speeds up the whole of the TTS generation, though Ive not played with it much myself.

https://docs.coqui.ai/en/latest/models/xtts.html#inference-parameters

You could manually introduce speed into tts_server.py by adding "speed": 1.6, (or a number of your choosing) by placing it in …

Replies: 1 comment 1 reply

Comment options

You must be logged in to vote
1 reply
@kaosbeat
Comment options

Answer selected by kaosbeat
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants