Release Improved TTS in 7000 Languages · DigitalPhonetics/IMS-Toucan

What's Changed

This release provides new checkpoints and improves some aspects of the previous release that were not included due to time constraints. For more information on the universal TTS model for 7000 languages, please refer to the previous release v3.0

Prosody prediction in terms of pitch, energy and durations are now stochastic and sample from a distribution instead of assuming a one-to-one mapping.
Added support for more IPA modifiers to cover more languages
Added more languages into the pretraining
Overhauled language similarity prediction modules and visualization

Full Changelog: v3.0...v3.1

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improved TTS in 7000 Languages

What's Changed