Add play and speed to cli options #3027

David-bfg · 2023-10-03T16:12:03Z

A quality of life addition for the cli tool.
Add play argument to play TTS after it is generated.
Additionally include speed argument to be used with Coqui Studio models.

--play argument uses simpleaudio to play the tts wav --speed <float 0.0-2.0> passes speed argument to Coqui Studio models

CLAassistant · 2023-10-03T16:12:12Z

All committers have signed the CLA.

erogol · 2023-10-06T10:08:43Z

Thanks for the PR @David-bfg

Would that work on different OSs? On linux you could just pipe to play commands I guess. It'd be better than introducing a new dependency.

David-bfg · 2023-10-06T13:35:51Z

Would that work on different OSs?

https://simpleaudio.readthedocs.io/en/latest/
simpleaudio is cross platform win, linux & mac

you could just pipe to play commands I guess.

Yes, I thought this would be preferable to just add a --pipe_out arg.

A lot or all of the logs are print lines so they would need to be suppressed to do so. I was not aware of such a feature beyond putting if statements around each log, but a cursory google search looks like there's something more manageable for that.

I'll look into it and circle back. Just hoping there is a similarly simple function to format the raw wav data to stdout as there is for saving it to a file.

Considering conversion to pipe wav data for audio playback with ohter program like aplay. This is incomplete code. Using to get feedback before proceeding with implementation.

David-bfg · 2023-10-06T23:18:39Z

TTS/bin/synthesize.py

+    pipe_out = sys.stdout if args.pipe_out else None
+
+    with contextlib.redirect_stdout(None if args.pipe_out else sys.stdout):
+        # Late-import to make things load faster


just indentation changes from lines
368-417 and
420-531.
Use hide whitespace to see changes better.

David-bfg · 2023-10-06T23:57:47Z

@erogol any comments you could give on commit f1b1f4a

I went and looked into getting the wav file sent to standard out instead of the logs. This appears to be generally what I'd be looking to implement. Just wanted to check that the idea looked reasonable before cleaning it up and fully implementing.
Thanks.

erogol · 2023-10-09T10:06:42Z

@David-bfg I think piping with standard out makes sense. But we should drop the play argument to keep things simpler.

removed play and simpleaudio dependency in place of pipe fuctionality to allow passing wav file data to a program dedicated to playing audio.

David-bfg · 2023-10-10T01:10:16Z

@erogol unless there are further code comments this should be complete or at it's last stage.

erogol · 2023-10-13T10:46:25Z

@David-bfg I'll review it next Monday. Thanks for the update.

omega3 · 2024-01-21T14:05:03Z

I installed via
pip install TTS
and
tts --text "companies seeking competitive advantage." --model_name "tts_models/en/vctk/vits" --speed 1.5 --out_path "$out" --speaker_idx p230

shows
tts: error: unrecognized arguments: --speed 1.5

David-bfg · 2024-01-21T17:25:02Z

@omega3 https://pypi.org/project/TTS/ & latest readme show speed removed from docs. it only worked with ⓍTTS voice model if i recall.

David-bfg added 2 commits October 3, 2023 10:45

add add cli options for play and speed

7de0455

--play argument uses simpleaudio to play the tts wav --speed <float 0.0-2.0> passes speed argument to Coqui Studio models

remove simpleaudio not referenced in file

2258e4d

David-bfg added 3 commits October 3, 2023 11:27

fix simpleaudio dependency version

445920e

add ALSA headers for simpleaudio compilation

f34b45a

Dockerfile ALSA headers for simpleaudio

403ae73

base changes to use stdout instead of play audio

f1b1f4a

Considering conversion to pipe wav data for audio playback with ohter program like aplay. This is incomplete code. Using to get feedback before proceeding with implementation.

David-bfg commented Oct 6, 2023

View reviewed changes

David-bfg and others added 12 commits October 9, 2023 13:30

remove play for pipe_out arg that suppresses stdout

b0cbbbb

removed play and simpleaudio dependency in place of pipe fuctionality to allow passing wav file data to a program dedicated to playing audio.

scipy.io.wavfile.write fails with /dev/null target

0f6fb0f

Streaming inference for XTTS 🚀 (coqui-ai#3035)

098fa07

v0.17.7

ce51251

Redownload XTTS with the local and remote config do not match

4717d8c

Remove unused method

6d8063c

Print a message when it is already donwloaded

a173415

Try-except to present error when the user dont have connection

1210aba

Fix style

a7cc0fa

0.17.8

63061e4

v0.17.8

b1d7591

Merge branch 'coqui-ai:dev' into add_play_and_speed_to_cli_options

29baa4a

erogol merged commit a151d70 into coqui-ai:dev Oct 16, 2023
48 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add play and speed to cli options #3027

Add play and speed to cli options #3027

David-bfg commented Oct 3, 2023

CLAassistant commented Oct 3, 2023 •

edited

Loading

erogol commented Oct 6, 2023

David-bfg commented Oct 6, 2023

David-bfg Oct 6, 2023 •

edited

Loading

David-bfg commented Oct 6, 2023

erogol commented Oct 9, 2023

David-bfg commented Oct 10, 2023

erogol commented Oct 13, 2023

omega3 commented Jan 21, 2024

David-bfg commented Jan 21, 2024

Add play and speed to cli options #3027

Add play and speed to cli options #3027

Conversation

David-bfg commented Oct 3, 2023

CLAassistant commented Oct 3, 2023 • edited Loading

erogol commented Oct 6, 2023

David-bfg commented Oct 6, 2023

David-bfg Oct 6, 2023 • edited Loading

Choose a reason for hiding this comment

David-bfg commented Oct 6, 2023

erogol commented Oct 9, 2023

David-bfg commented Oct 10, 2023

erogol commented Oct 13, 2023

omega3 commented Jan 21, 2024

David-bfg commented Jan 21, 2024

CLAassistant commented Oct 3, 2023 •

edited

Loading

David-bfg Oct 6, 2023 •

edited

Loading