PyTorch Implementation of ByteDance's Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised Training in Text-To-Speech
text-to-speech deep-neural-networks pytorch tts speech-synthesis generative-model semi-supervised-learning global-style-tokens neural-tts non-autoregressive parallel-tacotron non-ar emotion-transfer cross-speaker conditional-layer-normalization
-
Updated
Nov 9, 2022 - Python