Add TTS with VITS #360

csukuangfj · 2023-10-12T12:34:47Z

No description provided.

csukuangfj · 2023-10-13T11:25:00Z

Usage of this PR

Build sherpa-onnx with this PR

mkdir build
cd build
cmake ..
make -j

(py38) fangjuns-MacBook-Pro:build fangjun$ ls -lh bin/sherpa-onnx-offline-tts
-rwxr-xr-x  1 fangjun  staff    36K Oct 13 19:17 bin/sherpa-onnx-offline-tts

Download the model files

wget https://huggingface.co/csukuangfj/vits-ljs/resolve/main/vits-ljs.onnx
wget https://huggingface.co/csukuangfj/vits-ljs/resolve/main/lexicon.txt
wget https://huggingface.co/csukuangfj/vits-ljs/resolve/main/tokens.txt

Run the model

./bin/sherpa-onnx-offline-tts \
  --vits-model=./vits-ljs.onnx \
  --vits-lexicon=./lexicon.txt \
  --vits-tokens=./tokens.txt \
  'Success is not final, failure is not fatal, it is the courage to continue that counts!'

It will generate a file

$ ls -lh t.pcm
-rw-r--r--  1 fangjun  staff   489K Oct 13 19:16 t.pcm

Please use the following command to convert it to a wave file:

 sox -t raw -r 22050 -b 32 -e floating-point -c 1 ./t.pcm ./1.wav

And you will see

$ ls -lh 1.wav
-rw-r--r--  1 fangjun  staff   489K Oct 13 19:16 1.wav

$ soxi 1.wav

Input File     : '1.wav'
Channels       : 1
Sample Rate    : 22050
Precision      : 25-bit
Duration       : 00:00:05.68 = 125184 samples ~ 425.796 CDDA sectors
File Size      : 501k
Bit Rate       : 706k
Sample Encoding: 32-bit Floating Point PCM

I have converted 1.wav to 1.mov and posted it here so you can listen to it.

1.mov

csukuangfj · 2023-10-13T11:26:33Z

csukuangfj added 6 commits October 12, 2023 20:33

Begin to add TTS with VITS

1f66665

rename

4d27160

add offline tts vits model

222580c

first working version

65d338b

Add lexicon

0f1c9d9

Add a tts example

017e471

csukuangfj changed the title ~~WIP: Begin to add TTS with VITS~~ Add TTS with VITS Oct 13, 2023

csukuangfj merged commit 536d580 into k2-fsa:master Oct 13, 2023
133 of 144 checks passed

csukuangfj deleted the vits branch October 13, 2023 11:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add TTS with VITS #360

Add TTS with VITS #360

csukuangfj commented Oct 12, 2023

csukuangfj commented Oct 13, 2023

csukuangfj commented Oct 13, 2023

Add TTS with VITS #360

Add TTS with VITS #360

Conversation

csukuangfj commented Oct 12, 2023

csukuangfj commented Oct 13, 2023

Usage of this PR

Build sherpa-onnx with this PR

Download the model files

Run the model

csukuangfj commented Oct 13, 2023

TODOs