A Synthetic Corpus Generation Method for Neural Vocoder Training

📃[Paper] 💿[Demo]

Abstract: Nowadays, neural vocoders are preferred for their ability to synthesize high-fidelity audio. However, training a neural vocoder requires a massive corpus of high-quality real audio, and the audio recording process is often labor-intensive. In this work, we propose a synthetic corpus generation method for neural vocoder training, which can easily generate synthetic audio with an unlimited number at nearly no cost. We explicitly model the prior characteristics of audio from multiple target domains simultaneously (e.g., speeches, singing voices, and instrumental pieces) to equip the generated audio data with these characteristics. And we show that our synthetic corpus allows the neural vocoder to achieve competitive results without any real audio in the training process. To validate the effectiveness of our proposed method, we performed empirical experiments on both speech and music utterances in subjective and objective metrics. The experimental results show that the neural vocoder trained with the synthetic corpus produced by our method can generalize to multiple target scenarios and has excellent singing voice (MOS: 4.20) and instrumental piece (MOS: 4.00) synthesis results.

Installation

You can simply install packages by PyPI with the following command:

pip install -r requirements.txt

Quick Start

Start to generate a synthetic corpus with 100 pieces of audio

python corpus_generate.py

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
assets		assets
README.md		README.md
corpus_generate.py		corpus_generate.py
requirements.txt		requirements.txt
synthetic_corpus.py		synthetic_corpus.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

A Synthetic Corpus Generation Method for Neural Vocoder Training

📃[Paper] 💿[Demo]

Installation

Quick Start

About

Releases

Packages

Languages

zerlinwang/synthetic-corpus-vocoder

Folders and files

Latest commit

History

Repository files navigation

A Synthetic Corpus Generation Method for Neural Vocoder Training

📃[Paper] 💿[Demo]

Installation

Quick Start

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages