You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Recently, one-stage TTS models have gained a lot of ground in terms of quality and efficiency. They are vastly better than two stage models.
So, I suggest switching to vits or jets for this add-on. This will vastly simplify the deployment of voices.
One-stage TTS models could be exported to ONNX, and inferenced using onnxruntime. Even better, the inference can be accelerated using Cuda or DirectML without any additional efforts.
Last week, I've started a similar project to make some Vits TTS voices available for Windows, and the initial results are promising. You can check out my work at:
Hi @mush42 , thanks for your interest in this addon! And to be honest, I'm really new at this and I would really like to receive a collaboration from you and other users. They are welcome!
ForwardTacotron also works in real time in Cuda, even on long texts. The question would be to make two distributed versions, or maybe we will have better ideas if we talk about it on Discord.
I am going to read Cristal TTS and I will comment to you, and also, if it is possible to make new voices, I will be willing to do so.
And as for VITS TTS, it is also a good synthesizer, but it also has some disadvantages. For example, it doesn't work on long texts as it causes a minor speech problem, but I know that with your help it could be added as a separate Synth Driver.
I'll be in touch soon!
Hi @rmcpantoja
Recently, one-stage TTS models have gained a lot of ground in terms of quality and efficiency. They are vastly better than two stage models.
So, I suggest switching to vits or jets for this add-on. This will vastly simplify the deployment of voices.
One-stage TTS models could be exported to ONNX, and inferenced using onnxruntime. Even better, the inference can be accelerated using Cuda or DirectML without any additional efforts.
Last week, I've started a similar project to make some Vits TTS voices available for Windows, and the initial results are promising. You can check out my work at:
https://github.com/blindpandas/crystal-tts
If you want to collaborate on this, let's get in touch. I'm @mush42 on Discord.
Best
Musharraf
The text was updated successfully, but these errors were encountered: