Wav2Vec-Wrapper

An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.

Pretrained Models and Reproducibility

Paper	Description	Instructions
CORAA	Checkpoints for the paper: "CORAA: a large corpus of spontaneous and prepared speech manually validated for speech recognition in Brazilian Portuguese". More details here	link
SE&R-Challenge	Fine-tuning instructions for the ASR for Spontaneous and Prepared Speech, and Speech Emotion Recognition Shared task. More details here	link
YourTTS2ASR	Checkpoints for the paper: "ASR data augmentation in low-resource settings using cross-lingual multi-speaker TTS and cross-lingual voice conversion". More details here	link

Installation

Clone the repository.

git clone https://github.com/Edresson/Wav2Vec-Wrapper
pip3 install -r requeriments.txt

Install Flashlight dependencies to use KenLM

Use Docker:

In the Wav2Vec-Wrapper repository execute:

nvidia-docker build ./ -t huggingface_flashlight

Now see the id of the docker image you just created:

docker images

Using the IMAGE_ID run the command:

nvidia-docker run  --runtime=nvidia -v ~/:/mnt/ --rm  --shm-size=1g --ulimit memlock=-1 --ulimit stack=67108864 --name wav2vec-wrapper -it IMAGE_ID bash

Manually Instalation:

Please see the Flashlight documentation here

Inference

You can easily run inference on a folder of wav files by runing:

python3 test.py --config_path ./example/config_eval.json --checkpoint_path_or_name facebook/wav2vec2-large-xlsr-53-french --audio_path ../wavs/ --no_kenlm

To run inference with a KenLM language model, you need to specify the apppropriate paths in the config file and remove the --no_kenlm flag.

To generate the lexicon.lst file, you can use the ./utils/generate-vocab.ipynb notebook on your corpus.

Name		Name	Last commit message	Last commit date
Latest commit History 59 Commits
Papers		Papers
example		example
utils		utils
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
requeriments.txt		requeriments.txt
test.py		test.py
test_external_lm_test.py		test_external_lm_test.py
test_with_preprocessed_dataset.py		test_with_preprocessed_dataset.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Wav2Vec-Wrapper

Pretrained Models and Reproducibility

Installation

Install Flashlight dependencies to use KenLM

Use Docker:

Manually Instalation:

Inference

About

Releases

Packages

Contributors 2

Languages

License

Edresson/Wav2Vec-Wrapper

Folders and files

Latest commit

History

Repository files navigation

Wav2Vec-Wrapper

Pretrained Models and Reproducibility

Installation

Install Flashlight dependencies to use KenLM

Use Docker:

Manually Instalation:

Inference

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages