StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
-
Updated
Aug 10, 2024 - Python
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
This repository has implementation for "Neural Voice Cloning With Few Samples"
PyTorch Implementation of Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generation
PyTorch implementation of Densely Connected Time Delay Neural Network
DropClass and DropAdapt - repository for the paper accepted to Speaker Odyssey 2020
Open Source Implementation of Neural Voice Cloning with Few Audio Samples (Baidu Research)
Code for "Error-driven Fixed-Budget ASR Personalization for Accented Speakers" in ICASSP 2021
Implementation of the paper "Speaker Adaptive Training for Speech Recognition Based on Attention-over-Attention Mechanism" in Pytorch.
Add a description, image, and links to the speaker-adaptation topic page so that developers can more easily learn about it.
To associate your repository with the speaker-adaptation topic, visit your repo's landing page and select "manage topics."