StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
-
Updated
Aug 10, 2024 - Python
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
This repository has implementation for "Neural Voice Cloning With Few Samples"
PyTorch Implementation of Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generation
PyTorch implementation of Densely Connected Time Delay Neural Network
DropClass and DropAdapt - repository for the paper accepted to Speaker Odyssey 2020
Open Source Implementation of Neural Voice Cloning with Few Audio Samples (Baidu Research)
Code for "Error-driven Fixed-Budget ASR Personalization for Accented Speakers" in ICASSP 2021
This is a modified version of Kaldi speech recognition toolkit with the codes of standard and Bayesian adaptation approaches, e.g., LHUC, LHN, PAct, etc..
Implementation of the paper "Speaker Adaptive Training for Speech Recognition Based on Attention-over-Attention Mechanism" in Pytorch.
Add a description, image, and links to the speaker-adaptation topic page so that developers can more easily learn about it.
To associate your repository with the speaker-adaptation topic, visit your repo's landing page and select "manage topics."