CVAE-Tacotron

Code for the CVAE-NL model from the Accented Text-to-Speech Synthesis with a Conditional Variational Autoencoder paper, available at: https://arxiv.org/abs/2211.03316 Sample site available at: https://dapwner.github.io/CVAE-Tacotron/

Training

First preprocess your data into mel spectrogram .npy arrays with the preprocess.py script Then run CUDA_VISIBLE_DEVICES=X python train.py --dataset L2Arctic

Inference

Once trained, you can run CUDA_VISIBLE_DEVICES=X python synthesize.py --dataset L2Arctic --restore_step [N] --mode [batch/single/sample] --text [TXT] --speaker_id [SPID] --accent [ACC]

Or run synthesize_debug.py in a debugger and figure out!

###Inference modes

"single": takes in a reference audio to be used for both speaker and accent branches

"batch": allows to extract mu and std values for speakers and accents from a passed dataset

"sample": take the extracted mu and std values, choose your speaker with --speaker_id, and your accent with --accent, then synthesize the speech!

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
__pycache__		__pycache__
audio		audio
config/L2Arctic		config/L2Arctic
hifigan		hifigan
model		model
output		output
preprocessed_data/L2Arctic		preprocessed_data/L2Arctic
preprocessor		preprocessor
text		text
utils		utils
README.md		README.md
cvae.py		cvae.py
dataset.py		dataset.py
demo_generator.ipynb		demo_generator.ipynb
evaluate.py		evaluate.py
gmvae.py		gmvae.py
index.html		index.html
metadata.csv		metadata.csv
plot_embs.py		plot_embs.py
preprocess.py		preprocess.py
reqs.txt		reqs.txt
requirements.txt		requirements.txt
schematic.png		schematic.png
synthesize.py		synthesize.py
synthesize_debug.py		synthesize_debug.py
train.py		train.py
train_debug.py		train_debug.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CVAE-Tacotron

Training

Inference

About

Releases

Packages

Languages

AMAAI-Lab/CVAE-Tacotron

Folders and files

Latest commit

History

Repository files navigation

CVAE-Tacotron

Training

Inference

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages