miipher

This repository provides an unofficial implementation of speech restoration model Miipher. Miipher is originally proposed by Koizumi et. al. arxiv Please note that the model provided in this repository doesn't represent the performance of the original model proposed by Koizumi et. al. as this implementation differs in many ways from the paper.

Installation

Install with pip. The installation is confirmed on Python 3.10.11

pip install git+https://github.com/CShulby/miipher

Pretrained model

The pretrained model is trained on LibriTTS-R and JVS corpus, and provided in CC-BY-NC-2.0 license.

Inference in Batch

python run_miipher.py

You can also run in parallel on CPU by running the following script and passing a list of the wav files (note they should have corresponding transcriptions in the same folder):

python run_miipher_parallel.py --wav_list wav_list

If you are still hungry for more you can run the same way using full GPU inference:

python run_miipher_gpu.py --wav_list wav_list

Tests on an RTX 4090 showed a difference of 3.5x real time with the parallel CPU script vs. 30x real time on GPU

Differences from the original paper

	original paper	This repo
Clean speech dataset	proprietary	LibriTTS-R and JVS corpus
Noise dataset	TAU Urban Audio-Visual Scenes 2021 dataset	TAU Urban Audio-Visual Scenes 2021 dataset and Slakh2100
Speech SSL model	W2v-BERT XL	WavLM-large
Language SSL model	PnG BERT	XPhoneBERT
Feature cleaner building block	DF-Conformer	Conformer
Vocoder	[WaveFit]https://arxiv.org/abs/2210.01029)	HiFi-GAN
X-Vector model	Streaming Conformer-based speaker encoding model	speechbrain/spkrec-xvect-voxceleb

LICENSE

Code in this repo: MIT License

Weights on huggingface: CC-BY-NC-2.0 license

Name		Name	Last commit message	Last commit date
Latest commit History 45 Commits
.vscode		.vscode
examples		examples
pretrained_models		pretrained_models
scripts		scripts
src/miipher		src/miipher
test_wav		test_wav
tests		tests
.gitignore		.gitignore
.python-version		.python-version
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
requirements-dev.lock		requirements-dev.lock
requirements.lock		requirements.lock
run_miipher.py		run_miipher.py
run_miipher_gpu.py		run_miipher_gpu.py
run_miipher_parallel.py		run_miipher_parallel.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

miipher

Installation

Pretrained model

Inference in Batch

Differences from the original paper

LICENSE

About

Releases

Packages

Languages

License

CShulby/miipher

Folders and files

Latest commit

History

Repository files navigation

miipher

Installation

Pretrained model

Inference in Batch

Differences from the original paper

LICENSE

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages