#

speaker-adaptation

Here are 9 public repositories matching this topic...

yl4579 / StyleTTS2

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

text-to-speech deep-learning pytorch tts speech-synthesis gan speaker-adaptation adversarial-training diffusion-models wavlm latent-diffusion latent-diffusion-models

Updated Aug 10, 2024
Python

SforAiDl / Neural-Voice-Cloning-With-Few-Samples

This repository has implementation for "Neural Voice Cloning With Few Samples"

deep-learning voice tts speech-processing voice-synthesis saidl speaker-adaptation voice-cloning speaker-encodings mel-spectogram

Updated Feb 23, 2021
Python

keonlee9420 / StyleSpeech

PyTorch Implementation of Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generation

text-to-speech style pytorch tts speech-synthesis english speaker prosody meta-learning one-shot speaker-adaptation neural-tts non-autoregressive fastspeech speech-style stylespeech meta-stylespeech unseen-speaker

Updated Feb 10, 2022
Python

yuyq96 / D-TDNN

PyTorch implementation of Densely Connected Time Delay Neural Network

speech speaker-recognition speaker-verification speaker-diarization time-delay-neural-network speaker-embedding speaker-adaptation temporal-convolutional-network d-tdnn

Updated May 4, 2023
Python

cvqluu / dropclass_speaker

DropClass and DropAdapt - repository for the paper accepted to Speaker Odyssey 2020

machine-learning dropout kaldi representation-learning speaker-recognition speaker-verification meta-learning speaker-identification metalearning speaker-embedding speaker-adaptation

Updated Oct 29, 2020
Python

VisionBrain / Neural_Voice_Cloning

Open Source Implementation of Neural Voice Cloning with Few Audio Samples (Baidu Research)

deep-learning pytorch artificial-intelligence speech-synthesis voice-recognition speaker-recognition speech-processing audio-processing voice-synthesis mel-spectrogram speaker-adaptation speaker-encodings aryan05

Updated Oct 12, 2020
Python

awasthiabhijeet / Error-Driven-ASR-Personalization

Code for "Error-driven Fixed-Budget ASR Personalization for Accented Speakers" in ICASSP 2021

speech-recognition automatic-speech-recognition speech-to-text speaker-adaptation accent-adaptation icassp2021 speaker-accent accented-speech

Updated Jun 13, 2021
Python

XIEXurong / kaldi_bayes_adapt

This is a modified version of Kaldi speech recognition toolkit with the codes of standard and Bayesian adaptation approaches, e.g., LHUC, LHN, PAct, etc..

end-to-end speech-recognition bayesian adaptation lf-mmi speaker-adaptation

Updated Nov 16, 2023
Shell

HarunoriKawano / SAT-based-attn-over-attn

Implementation of the paper "Speaker Adaptive Training for Speech Recognition Based on Attention-over-Attention Mechanism" in Pytorch.

pytorch speaker-adaptation

Updated May 19, 2023
Python

Improve this page

Add a description, image, and links to the speaker-adaptation topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the speaker-adaptation topic, visit your repo's landing page and select "manage topics."